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Abstract 

We consider differential equations driven by rough paths and study the regularity of 
the laws and their long time behavior. In particular, we focus on the case when the 
driving noise is a rough path valued fractional Brownian motion with Hurst parameter 
H £ (|, |). Our contribution in this work is twofold. 

First, when the driving vector fields satisfy Hormander's celebrated "Lie bracket 
condition", we derive explicit quantitative bounds on the inverse of the Malliavin ma- 
trix. En route to this, we provide a novel "deterministic" version of Norris's lemma for 
differential equations driven by rough paths. This result, with the added assumption 
that the linearized equation has moments, will then yield that the transition laws have 
a smooth density with respect to Lebesgue measure. 

Our second main result states that under Hormander's condition, the solutions to 
rough differential equations driven by fractional Brownian motion with H 6 (i, |) 
enjoy a suitable version of the strong Feller property. Under a standard controllability 
condition, this implies that they admit a unique stationary solution that is physical in 
the sense that it does not "look into the future". 

1 Introduction 

In this article, we consider stochastic differential equations of the form 

d 

dZ t = V a (Z t ) dt + J2 Vi(Z t ) dX l t , Z = z e ffi" , (1.1) 

where X t is a d-dimensional random rough path [Lyo98, LCL07, FVlOb] and Vo,Vi € 
R™ are smooth vector fields. While a large part of our work is deterministic and applies 
to a large class of rough differential equations driven by rough paths that are Holder 
continuous with index greater than i, our probabilistic results focus on the case when 
X t is a two-sided d-dimensional fractional Brownian motion with Hurst parameter 
H £ (|, |). Recall that the fractional Brownian motion with Hurst parameter H is the 
centred Gaussian process such that Xq = and 



M\X t -X s \ 2 = \t-s 



2H 
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Differential equations driven by rough paths have been studied intensely in the past 
decade and this theory has now reached a certain level of maturity; we refer to the 
monographs [LQ02, LCL07, FVlOb] for an overview of the theory. For driving sig- 
nals that are rougher than Brownian motion, the theory of rough paths has provided a 
systematic way of constructing solutions to differential equations of the type (1.1) in 
a way that is "natural", in the sense that solutions are limits of approximate solutions 
where X is replaced by a smoothened version. 

When the noise X t is a replaced by a standard Brownian motion B t , it has been 
well-known since the groundbreaking work of Hormander [H6r67] that, for the laws of 
the Markov process Z t to have a smooth transition density, it is sufficient that the Lie 
algebra formed by {d t +V , V\, . . . , V^} spans W l+1 at every point (see Assumption 5.1 
for a precise formulation). The formalism of Malliavin calculus was invented to give 
a probabilistic proof of this result [Mal78, Mal97, Nua06, KS84, KS85, KS87]. The 
smoothness of transition densities coupled with some mild controllability assumptions 
will then yield that the system (1.1) has a unique invariant measure. 

When the driving noise X is a fractional Brownian motion with H ^ -|, solutions 
to (1.1) are neither a Markov process nor a semimartingale, so standard tools from 
stochastic calculus break down. Inspired by the results in the case of Brownian motion, 
two natural questions in this context are to identify conditions under which 

1. the "transition densities" of (1.1) are smooth, 

2. the system (1.1) has a "unique invariant measure". 

Since Z is not Markov in general, it does not really make sense to speak of transition 
probabilities, but the first question still makes sense by, for example, considering the 
law of the solution at some time t > 0, given an initial condition Z , conditional on the 
realisation of {X s : s < 0}. Similarly, the notion of an "invariant measure" does not 
make immediate sense for non-Markovian processes. This problem has been discussed 
extensively in [Hai05, Hai09], where a notion of an invariant measure adapted to sys- 
tems of the type (1.1) is introduced. Essentially, these are stationary solutions to (1.1) 
that are "physical" in the sense that they are independent of the innovation of X. 

In recent years, the SDE (1.1) was studied when the driving noise X is a fractional 
Brownian motion with Hurst parameter H > |. In this case, the answers to both of 
the above questions are completely settled in a series of papers [Hai05, BH07, HO07, 
NS09, HP11]. In particular, it was shown in [BH07, HP11] that the solutions to (1.1) 
have smooth "transition densities" when the vector fields satisfy Hormander's condi- 
tion. It was also shown that if furthermore the control system associated to (1.1) is 
approximately controllable then, under suitable dissipativity and boundedness condi- 
tions on the vector fields Vi, (1.1) also admits a unique invariant measure. However, 
the question of smoothness of laws in the case of the driving noise X being a fractional 
Brownian motion with Hurst parameter H < | was completely open until now, despite 
substantial recent progress in particular cases [DrilO, HT1 1]. The only general result 
in the context of rough paths theory was obtained in [CF10], where the authors ob- 
tained the existence of densities with respect to Lebesgue measure under Hormander's 
condition for a large class of driving noises. 

In this paper we largely settle the above two questions when X is a fractional Brow- 
nian motion with H £ (g, i). An important component underlying the probabilis- 
tic proofs of the smoothness of transition densities is Norris's lemma [KS85, Nor86], 
which roughly speaking states that if a semimartingale is small and if one has a priori 
bounds on the regularity of its components, then its bounded variation part and the 
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local martingale part are also small. In this regard this lemma can be considered as a 
quantitative version of the classical Doob-Meyer decomposition theorem. A version 
of Norris's lemma for fractional Brownian motion with H > | was proved in [BH07]. 
The recent work [HT1 1], which appeared as the present article was nearing completion, 
contains a version of Norris's lemma for H e (|, |) that is similar in spirit to the one 
in[BH07]. 

Our contribution in this work is twofold. First, we prove a deterministic version 
of Norris's lemma for general integrals against rough paths. This may sound strange 
at first since Norris's seems to be the prototype of a probabilistic statement and the 
whole point of rough path theory is to get rid of stochastic calculus and replace it by 
a deterministic theory. We reconcile these conflicting perspectives by first showing an 
estimate strongly resembling that of Norris's lemma for processes of the form Z t = 
/ A s dX s + J B s ds, where X is a rough path and A is a rough path "controlled by 
X" (see Section 2 below for precise definitions). This estimate makes use of a quantity 
that we call the "modulus of Holder roughness" of X, Lg(X). See Definition 3.1 below 
for the precise definition of Lg. In a second step, we then show that if X is fractional 
Brownian motion with H < ^, then Lg(X) is almost surely positive for 9 > H and 
has inverse moments of all orders. A loose formulation of our main result is as follows 
(see Theorem 3.4 and Lemma 3.9 below for precise formulations that include the exact 
dependency of M on X, A, and B): 

Theorem 1.1 Let X be a ^-Holder continuous rough path in W 1 with 7 > g, let A be 
a rough path in W l controlled by X, let B be a ^-Holder continuous function, and set 

Z t = [ A s dX s + f B s ds. (1.2) 
Jo Jo 

Then, if X is 9-Hdlder rough for some 6 < 27, there exist constants r > and q > 
such that one has the bound 

\\A\\ 00 + \\B\\ 00 <ML e (Xr q \\Z\\ r 00 , 

for a constant M depending polynomially on the ^-Holder "norms" of X, A, and B. 
Here, || • ||oo denotes the supremum norm over the interval [0, 1]. 

Furthermore, if X is the rough path canonically associated to fractional Brownian 
motion with H < \, then ELg P (X) < 00 for every 9 > H and every p > 0. 

Remark 1.2 Note that this immediately tells us that if X is Holder rough, then it 
admits a kind of Doob-Meyer decomposition in the sense that the processes A and B 
in (1.2) are uniquely determined by Z. An interesting fact is that Holder roughness 
is a deterministic property. In principle, one could imagine being able to check that 
this property holds almost surely for a number of driving noises, not even necessarily 
Gaussian ones. 

Combined with standard arguments, this result yields quantitative bounds on the 
inverse of the Malliavin matrix, thus obtaining a quantitative version of the result ob- 
tained in [CF10], where the authors showed via a 0-1 law argument that the Malliavin 
matrix is almost surely invertible. If we use the additional assumption that the lineariza- 
tion of the SDE (1.1) has moments of all orders, our results also yield that (1.1) has 
smooth densities thus extending the work pioneered by Malliavin to the case in which 
the driving noise is a rough path. When this work was nearing completion, we were 
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notified of an independent work [HT1 1] showing smoothness of densities to solutions 
to (1.1) in the case when the driving vector fields exhibit a "nilpotent" structure, which 
allows to obtain a priori bounds on the Malliavin derivatives of the solutions. The work 
[HT11] also contains a version of Norris's lemma in the context of SDEs driven by 
fractional Brownian motion with Hurst parameter H £ (g , \). 

In the second half of the paper, we show that under an additional controllability 
assumption, the SDE (1.1) has a unique invariant measure, which follows from the 
strong Feller property defined in [HO07]. Our main technical problem in this part is 
that we do not assume that the linearised equation has bounded moments. This leads 
us to use a cutoff argument similar to the one already used in [HO07]. If we denote by 
A t . z the closure of the set of all points that are accessible at time t for solutions to the 
control problem associated to (1.1) starting at z, the second major result of this paper 
is the following: 

Theorem 1.3 Assume that the vector fields {Vi} have derivatives with at most polyno- 
mial growth and that (1.1) has global solutions. Then, if Hormander's bracket condi- 
tion holds at every point, (1.1) is strong Feller. 

In particular, if there exists t > such that HzeR™ ^t,z 7^ <t>> tnen (1-1) admits at 
most one invariant measure in the sense of[Hai05]. 

The above result gives us the uniqueness of the invariant measure for the system 
(1.1). Theorem 1.3, combined with the assumption of the existence of an invariant 
measure, will yield that the system (1.1) is ergodic. 

The remainder of the article is structured as follows. In Section 2 we review the 
framework of controlled rough paths from [Gub04], set up the notation and derive 
some preliminary estimates. In Section 3, we then prove a general deterministic ver- 
sion of Norris's lemma for SDEs driven by rough paths. Furthermore we show that our 
assumptions are almost surely satisfied by the sample paths of fractional Brownian mo- 
tion. Section 4 is a rather technical section in which we show that solutions to ( 1 . 1 ) are 
smooth in the sense of Malliavin calculus and obtain a priori bounds on their Malliavin 
derivatives. We then obtain quantitative bounds on the lowest eigenvalue of the Malli- 
avin matrix in Section 5. Using the results in that section, we show that the existence of 
moments of the derivative of the flow implies the smoothness of the transition densities. 
In Section 6, we show the ergodicity of SDEs driven by fractional Brownian motion 
under Hormander's condition and a standard controllability assumption. In Section 7, 
we finally give a few examples where our results are applicable. 
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2 Preliminaries 

2.1 Notations 

Throughout this article, we will make use of the following notations. For quantities 
E and R, we write E < K(R) as a shorthand to mean that there exists a continuous 
increasing function b : M + n> R + such that the bound E < b(R) holds. Note that the 
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function b in question is unspecified and may change from one line to the next. We also 
use the letter M to denote an arbitrary (possibly problem-dependent) constant whose 
precise value might vary from one line to the next. 

2.2 Introduction to the theory of rough paths 

In this work, we adopt the framework of [Gub04] which offers a slightly different 
perspective on the pioneering work of Terry Lyons [Lyo98]. 

Denote by flC the set of continuous functions from R 2 to M which are on the 
diagonal and define the "increment" operator 6 : C i-> HC by: 

5A st = A t - A s . (2.1) 
For a fixed final time T > and a continuous function / : [0, T] M> R", set 

||/||oo= sup \f(t)\, ||/|| 7 = sup J^L. (2.2) 

t€[0,T] s,te[0,T] \ l - s \' 

We also define the norm 

||/lb = ||/||oo + ||/|| 7 . 

With these notations, a rough path on the interval [0, T] consists of two parts, a 
continuous function X : [0, T] i-> M. d and a continuous "area process" X : [0, T] 2 i-> 
R dxd , X G VlC satisfying the algebraic identity: 

X^-X£-X^ = 6XtJXi t (2.3) 

for all {s, u, t} e [0, T] and 1 < i, j < d. For X e OC, define 

||X|| 27 i= f sup -^L. (2.4) 
s.teio.T] \t — s\ ' 

sjtt 

For 7 G (1/3, 1/2], we denote by 2? 7 ([0, T],R d ) the space of rough paths, consist- 
ing of those pairs (X, X) satisfying (2.3) and such that 

||(X,X)|| 7 ^||X|| 7 + ||X|| 27 <oo. 

Notice that \\(X, X)|| 7 is only a semi-norm and that V 1 actually isn't a vector space, 
due to the nonlinear constraint (2.3). 

For every smooth function X : [0, T] — > R d , there exists a canonical representative 
in £> 7 by choosing 



\ Si t = i SX sr ® dX r . 



I 

J s 



We then denote by the closure of the set of smooth functions in V 1 . (Here, g stands 
for "geometric".) The space 2?J has the nice feature of being a Polish space, which 
will be useful in the sequel. 
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2.3 Controlled rough paths 

For defining integrals with respect to rough paths, a useful notion introduced first in 
[Gub04] is that of "controlled" paths: 

Definition 2.1 Let(X,X) G Z? 7 [0, T] for some 7 G (1/3,1/2]. A pair (Z, Z') is 
said to be controlled by X if Z G C 7 ([0, T], M"), Z' G C 7 ([0, T],R nxd ), and the 
"remainder" term R z G implicitly defined by 

SZ st = Z' s SX st + Rf t , (2.5) 

satisfies ||-R Z || 27 < 00. 

Denoting by C x the set of those paths that are weakly controlled by X, we endow 
it with the norm 

\\(Z, Z')\\ x ^ = \Z(0)\ + \\Z'\\c, + \\R z h 7 . 

As noticed in [Gub04], now we may define the integral of a weakly controlled path 
(Z, Z') G C x with respect to a rough path (X, X) by taking a limit of modified Riemann 
sums: 

f Z t dX t = lim V (z s 6X st +ti(Z' s X T t )) , (2.6) 
Jo |Ph ° |s ,(]eP 

where P is a finite partition of the interval [0, T] into subintervals and |P| denotes the 
length of the largest subinterval. The following result, adapted from [Gub04, Proposi- 
tion 1], gives the continuity of the integral with respect to its integrand: 

Theorem 2.2 Let (X, X) e X> 7 [0, T]for some 7 > 1/3 and (Y, Y') G C x be a weakly 
controlled rough path. Then the map 

(Y, Y') ^ (Z, Z') * (y Y t dX u Y®l), 

where the integral is as defined in (2.6), is continuous from C x to C x and furthermore 
we have the bound, 

\\R Z h, < m(\\X\\ 7 \\R y \\ 2i + ||X|| 27 ||r'|| c ,) . (2.7) 
Remark 2.3 Notice that from (2.7) we deduce that 

\\(Z, Z')\\ Xn < M(||X|| 7 (||i? y || 27 + \\Y\\c-r) + ||X|| 27 |[r'|| c ,) . (2.8) 

For (Y, Y 1 ) G C\ and a C 2 function tp : K" t-> R m , we may define a new weakly 
controlled rough path (V>(Y), ip(X)') G C x as 

iKY)t = il><Yt), ^{Y)' t = D^{Y t )Yl . (2.9) 
Then we have the following bound from [HailO, Lemma 2.2] : 

Lemma 2.4 Let (Y, Y') G C x and (ip(Y), ip(Y)') be as defined in (2.9). Then we have 

||W(Y), W)lk 7 < M{\ + Halloo + PVlloc)d + ||(X,X)|| 7 )(i + ||(r,y')IU, 7 ) 2 , 

where the supremum norms ofip and D 2 \p are taken over the ball of radius || Y||oo- 
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2.4 Notion of solution 

With all of these notations at hand, we give the following definition of a solution to 
(1.1): 

Definition 2.5 Let 7 > | and let (X,X) G £> 7 . Then, Z G C is a solution to 
(1.1) if (Z, Z 1 ) = (Z, F(Z)) G and the integral version of (1.1) holds, where the 
composition of a controlled rough path with a nonlinear function is interpreted as in 
(2.9) and the integral of a controlled rough path against X is defined by (2.6). Here, we 
denoted by V the collection (V\ ,. .., Vd). 

A standard fixed point argument, as given for example in [Lyo98, Gub04], then 
yields: 

Theorem 2.6 For V G C 3 , there exists a unique local solution to (1.1). 

From now on, we will always refer to this notion of a solution to (1.1), without 
further specifying it. 

3 A version of Norris's lemma 

One of the main ingredients of the proof of Hormander's theorem using Malliavin 
calculus is Norris's lemma, which is essentially a quantitative version of the Doob- 
Meyer decomposition theorem. Loosely speaking, it states that under certain additional 
regularity assumptions, if a semimartingale is "small", then both its bounded variation 
part and its martingale part have to be "small" separately. In other terms, if we have 
some a priori knowledge of the regularity of a semimartingale, then there is a limit to 
the amount of cancellations that can occur between the two terms in its Doob-Meyer 
decomposition. The intuitive reason for this is that a continuous martingale is nothing 
but a time-changed Brownian motion and so it has to be very rough at every single 
scale. 

Results of this type are usually considered to be the archetype of a probabilistic re- 
sult. The aim of this section is to argue that while the probabilistic intuition described 
above is certainly true in some sense, one can have a much more pathwise perspective 
on Norris's lemma. This was already apparent in [HM09], where the authors obtain a 
result that is similar in flavour to Norris's lemma, but where this lack of cancellations 
is formulated as a deterministic property that occurs on a universal "large" subset of 
Wiener space. Here, we take this viewpoint one step further by exhibiting a univer- 
sal set of probability 1 on which a quantitative version of Norris's lemma holds as a 
deterministic property. 

The main ingredient in our pathwise perspective is the following definition that 
makes precise what we mean by a path that is "rough at every scale": 

Definition 3.1 A path X t with values in K n is said to be 9-Hdlder rough in the interval 
[0, T] for 9 G (0, 1), if there exists a constant L (X) such that for every s G [0, T], 
every e G (0, T/2], and every ip G R™ with \\ip\\ = 1, there exists t G [0, T] such that 

!<|t-«|<e, and \{ip,8X Stt )\ > Lg(X)e e . (3.1) 

We denote the largest such Lg the "modulus of ^-Holder roughness of X". 
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Remark 3.2 We emphasize that the choice of quantifiers in the above definition en- 
sures that such Holder rough paths actually do exist. In particular, as soon as n > 2, 
it is essential to allow the precise location of s such that (3.1) holds to depend on the 
vector ip. 

A first rather straightforward consequence of this definition is that if a rough path 
(X, X) happens to be Holder rough, then the "derivative process" Z' in the decompo- 
sition (2.5) of a controlled rough path is uniquely determined by Z. This can be made 
quantitative in the following way: 

Proposition 3.3 Let (X, X) G V ([0, T] , MP) be such that X is a 6-Hdlder rough path. 
Then there exists a constant M depending only on n and m such that the bound 

\\Z'\\„<¥^(\\R*\\£ \\Z\\-JvT-°), 

holds for every controlled rough path (Z, Z 1 ) G CJ([0, T\,WL m ). 

Proof. Fix s G [0, T] and e G (0, T/2], From the definition of the remainder R z in 
(2.5), it then follows that 

sup \Z' s 5X s , t \< sup (\5Z s . t \ + \R z t \)<2\\Z\\ co +\\R z \\ 2l e^ . (3.2) 

\t-s\<e |*-«|<e 

Let now Z' s (j) denote the jth row of the matrix Z' s . Since X is ^-Holder rough by 
assumption, for every j < d, there exists v = v(j) with e/2 < \v — s\ < e such that 

\{Z' s (j),SX., v )\ >L e (X)e d \Z' s (j)\ ■ (3.3) 
Combining both (3.2) and (3.3), we thus obtain that 

L e (X)e e \Z' s (j)\ < 2||Z|U + ||i? z || 27 e 27 • 
Summing over the rows of Z' s yields a universal constant C such that 

Lg(X)e e \Z' s \<C(\\Z\\ 00 + \\R z \\ 2 . ( e^) . 
Optimising over e, we choose e = \\Z\\H 2l \\R z \\2^ 2 '' A (T/2), thus deducing that 

\Z' s \<^^(\\R Z m\\ZfJvT- e ). 

Since s was arbitrary, the stated bound follows at once. □ 

One way of reading Proposition 3.3 is to say that if ||Z||co is small, then ||Z'||oo 
must also be small, provided that (Z, Z') G C\ ([0, T], R m ) and that X is Holder rough. 
In the following theorem, we apply Proposition 3.3 to obtain a quantitative version of 
a "Doob-Meyer type decomposition" for SDEs driven by a rough path X. This is the 
main new technical result of this article. 

Theorem 3.4 Let (X,X) G V'([0,T],R n ) with 7 > ± be such that X is 6-Hdlder 
rough with 2 7 > 6. Let (A, A') G C\ ([0, T], M m ) and B e C 7 ([0, T] , R m ), and set 

Z t = [ A s dX s + f B s ds. (3.4) 
Jo Jo 
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Then, there exist constants r > and q > such that, setting 

Tl^l + LeiXY 1 + \\(X, X)|| 7 + \\(A, A')\\ Xn + \\B\\ C , , 
one has the bound 

\\A\\ 00 + \\B\\ 00 <MKi\\Z\\ r 00 , 
for a constant M depending only on T, m and n. 

Remark 3.5 The proof provides the explicit value q = 6 and shows that r can be taken 
arbitrarily close to (27 - 9) 2 (3"/ - 1)/(47 2 (1 + 7)), but these values are certainly not 
optimal. 

Proof. Note first that the definition of Z does not change if we add a constant to X. 
We will therefore assume without loss of generality that Xq = 0, so that ||^||oo < 
T 7 |[X|| 7 < T^IZ. By Theorem 2.2 we deduce that the pair (Z, A) is a weakly con- 
trolled rough path, (Z, A) e C£ [0, T], with 

SZ = A5X + R Z (3.5) 

and 

\\R z \\ 2l < M(||X|| 7 ||i? A || 27 + ||X|| 27 ||A|| C , + ||fl|| c ,) < MV? . 

We deduce from the above that in particular, we have the a priori bound ||Z||oo < 
MK 2 . 

It then follows from Proposition 3.3 that 



lAHoo < MLe(X)- 1 \\Z\\ X 2 m\R z \\ 2 2 l + \\Z\\£) < MH^Z^ ^ . (3.6) 



This is already the requested bound on A. The bound on B is slightly more difficult to 
obtain. 

At this stage, we would like to make use of the information that H^Hoo is "small" 
to get a bound on the integral of A against X. In order to do so, it turns out to be 
convenient to choose a /3 G (1/3,7) with W > @> so that we can interpret (X, X) 
as an element of V^([0,T],R n ) with ||(X,X)||^ < M||(X,X)|| 7 . This will allow us 
to make use of interpolation inequalities to combine our a priori knowledge about the 
boundedness of (A, A') in C\ norm with (3.6) to conclude that (A, A') is small in C^ x . 

We first obtain a bound on A'. Since (A, A') e C^([0, T] , R m ), we infer from (3.6) 
and Proposition 3.3 that 

WA'Wcc < MLeiXy 1 \\R A \\f< ||A||L~* < MK A \\Z\\t~^ ? . 
Using the inequality 

\\A'y<2\\A'\\]\\A'\\ 1 -^ , (3.7) 
which follows immediately from the definition of the Holder norm, we obtain the bound 

\\A'\y < M\\A'\^\\A'\\l < MK 4 \\Z\\ ( L~^ f<1 ~^ , 

where we used the fact that f3 < 7. Similarly, we would like to obtain a bound on 
1 1 -R" 4 II 2/3 - Combining the definition of R A with (3.6), we deduce that 

Halloo < 2(||A|U + KlUIXHoJ < MK 5 \\Z\\ { L~^ f . 
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Using the obvious equivalent to (3.7), we conclude that 



\R A U < M\\R A \\l y \\R A \\oo " < MK 5 \\Z 



,(i-£) 2 (i^) 



We are now in a position to use Theorem 2.2 to bound the integral J Q A s dX s . Indeed, 
we obtain from (2.7) the bound 



A s dX t 



< MflAolpflloc + \\xy\\R A \\ 2l3 + ||x|| 2 ^||A'||^) 



Inserting into this bound all of the above estimates, we conclude that 

,(i-£) 2 (i-^) 



A s dX s 



< MK b \\Z\ 



(3.8) 



This estimate, together with the definition (3.4) of Z immediately implies that we also 
have the bound 



L 



B s ds 



<MH 6 \\Z\L 



Once again we use an interpolation inequality to strengthen this bound. Applying the 
interpolation inequality 



t/||oo < MH/IU max( 



Lwn^wdtfwr 1 



(see [HM09, Lemma 6.14]) with f(t) = f* B s ds, it follows that 



\B\ 



< MK b \\Z 



l' 1 2-f> {1 y'l-l 



(3.9) 



The claim now follows from (3.6) and (3.9), and the remark following the statement 
follows by choosing (3 w |. □ 

3.1 Holder roughness of sample paths of fBm 

Our aim now is to show that the sample paths of fractional Brownian motion are indeed 
almost surely Holder rough and to provide quantitative bounds on the tail behaviour of 
Lg(X) for a suitable 9. 

Let {J 7 * ,sel}be the natural filtration generated by the fBm {X,} seR . We start 
with the following lemma on the small ball probability of the conditioned fBm: 

Lemma 3.6 Let ^£l° with \\f\\ — 1 and 5 < 1. Then, there exist constants M and 
c such that the bound 



»( inf sup \(<p,5X st )\ <e 



Tit < Me 



(3.10) 



holds for every < e < 1. 



Proof. For the moment, let us fix an arbitrary with \\<p\\ = 1. By the triangle inequal- 
ity we obtain 

P( sup \{<p,5X st )\ < 2e\T*) < p( sup | (<p,X t ) \ <e\T*) . 



te[o,S] 
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At this stage, we note that there exists a one-dimensional Wiener process W (depending 
on tp) independent of J 7 *, a stochastic process Y such that Y t is J 7 * -measurable for 
every t > 0, and a constant c such that 

(<p,X t )=Y t + c (t- s ) H -Uw(s)"Y t +X t . (3.11) 
Jo 

(See for example [MVN68, Hai05].) For the rest of the proof, we use c for a generic 
universal constant that will change from expression to expression. 

It then follows from Anderson's inequality ([And55]) that we have the bound 

P( sup \(<p,X t )\ <e\F?) =P( sup \X t +Y t \<e 

K t£[0,S] ' v tG[0,<5] ' 

<P( sup \X t \ <e) . 

V tG[0,<5] ' 

On the other hand, we can invert the expression (3.11) for X, yielding 

W t = c f (t - s)i~ H dX s =c [ (t- s)-?- H X s ds . 
Jo Jo 

In particular, provided that H < |, we have the bound 

sup \W t \ < c5^- H sup \X t \ , 

t€[0,<5] te[0,S] 



so that 



sup \{<p,X t )\ < e\Fj?) < f( sup \W t \ < cS^~ H e\ < Me~ c,)2 " : 

•te[0,<5] ' V tG[0,5] 



where the last inequality is the well known small ball probability for the standard Brow- 
nian motion [LS01]. 

Up to now, the calculation was performed with a fixed instance of ip. The conclu- 
sion then follows by a standard covering argument, see for example [Nor86, p. 127]. 

□ 

We have the following corollary of Lemma 3.6: 

Corollary 3.7 For any interval Is = [ui, ui + S] C M of length S and any u < ug, 
there exist constants M and c such that the bound 



5 f inf 

\\M\=i 



sup \{<p,5X st )\ < £ |JPf ) <Me 
s,tei s 



holds for every < e < 1. 

Proof. Define the event G = {inf|| v || =1 sup s tgJj \{if,5X st )\ < e}. Since the incre- 
ments of the fBm are stationary, by Lemma 3.6 we obtain the bound 



E 



(l G |^) <Me- cS2H ^ 2 . (3.12) 
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Now notice that for any G G J 7 ^ and u < u e , E(G\ T*) = e(e(G| Tfy\T*} ■ Since 
the right hand side of Equation (3.12) does not depend on ug, it immediately follows 
that 

P( inf sup \{<p,5X*)\ <e\F?) =e(i g \F?) < Me"^"' (3.13) 
v llvll=i s.teh ' v ' 

and the corollary follows. □ 

Remark 3.8 Although the above proof requires that H < |, one would expect that a 
result similar to that of Lemma 3.6 holds for all Gaussian processes with covariance 
function C(-, •) satisfying the bounds 

Mf\t~s\ 2H < C(s,t) < Ml\t-s\ 2H s,t G [0,T] 

for some constants M £l M u , even if H > \. For instance, a result similar to Lemma 
3.6 can be shown to hold in the case of ffim with index H > | ; see for example [BH07, 
Proposition 3.4]. 

Using this estimate we are now in the position to obtain bounds on the modulus of 
Holder roughness for fractional Brownian motion with H < ^ by a type of chaining 
argument. 

Lemma 3.9 Let X be a fBm with Hurst parameter H < ^. Then, for every 9 > H, the 
sample paths of X are almost surely Q -Holder rough. Moreover, there exists constants 
M and c such that 

P(L 9 (X) < e | Fq) < M exp(-c£- 2 ) , 
for all e G (0, 1). In particular, E(Lg P (X) \ Tq) < oo for every p > 0. 
Proof. A different way of formulating Definition 3.1 is given by 

Le(X) = inf inf inf sup 7, • 

11^11 = 1 t£[0,T] r£[0,T/2] r /2<\t-s\<r T 

We then define the "discrete analog" Dg(X) of Lg(X) to be 

n « . - . - . , \{ip,5X st )\ 
De(X) = inf inf inf sup — , 

11^11 = 1 n>lfc<2« s>te /; iB (2-™T) 

where I k , n = [^T, £T]. We first claim that 

Le(X) > ^Dg(X) . (3.14) 

Indeed, given t G [0, T] and r G [0, T/2], pick n G N such that r/8 < 2~ n T < r/4. 
It follows that there exists some k such that Ik, n is included in the set {s : r/2 < 
\t — s\ < r}. Then, by definition of Dg, for any unit vector <p there exist two points 
ti 7 t 2 G /„ such that 

\{<p,SX t3tl )\ >2- n9 D e (X). 
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Therefore by the triangle inequality, we conclude that the magnitude of the difference 
between (<p, X t ) and one of the two terms (ip, X ti ),i = 1, 2 (say ti) is at least 

\(<P,SX tlt )\ > \-{2- n T) e D e {X) 

and therefore 

\(<P,8X tlt )\ 1 2"«» 1 

^ " 2 ' — De{X) " 2^F De(X) ' 

Since i, r and 93 were chosen arbitrarily, the claim (3.14) follows. 

It follows that it is sufficient to obtain the requested bound on ¥(Dg(X) < e | J 7 *)- 
We have the straightforward bound 

*±£'(&.% ^ <«!>?)■ 

n=l fe=l s.tfcJfc.n 

Applying Lemma 3.6 and noting that the bound obtained in this way is independent of 
k, we conclude that 

00 00 
P(D e (X) < e I J"<f ) < M ^ 2" exp(-c2 2n(0 - H) £- 2 ) < M ^ exp(-Sn£- 2 ) . 

n—1 n=l 

Here, we used the fact that we can find constants K and c such that 

n log 2 - c2 2 «( e - ff ) e - 2 < A- - 5ne- 2 , 
uniformly over all e < 1 and all n > 1. We deduce from this the bound 

P(Z) e (X) < e I J"(f ) < M^e _& " 2 + y exp(-ce _2 a;) dxj , 

which immediately implies the result. □ 



4 Malliavin derivatives 

In this section, we derive formulae for the Malliavin derivatives of solutions to (1.1), 
when conditioned on the past of the driving noise. In order to clarify the meaning of 
this statement, we will reduce this conditioned solution to a functional of an underlying 
Wiener process. With this notation, the Malliavin derivative will simply be the "usual" 
Malliavin derivative of a random variable on Wiener space. 

Before proceeding further, let us make a digression that clarifies this construction. 
For a 6 (0, 1), we define the fractional integration operator l a and the corresponding 
fractional differentiation operator V a by 

T(a)Jo t (4.1) 
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These operators are inverses of each other, see for example [SKM93] for a survey of 
fractional integral operators. 

It turns out that the operator T^~ H is an isometry between the Cameron-Martin 
space of the conditioned ffim and that of the underlying Wiener process mentioned at 
the beginning of this section. More precisely, given a typical instance W- e C(R_ , R d ) 
of the "past" of the fBm, it follows from the Mandelbrot-van Nesse representation 
of the fractional Brownian motion [MVN68, Hai05] that there exists a constant 
and a (one-sided) Wiener process W on R + independent of u;_ such that the future 
w + e C(M + , M. d ) of the fBm conditioned on the past w_ may be expressed as 

w + =Qw_+a H V*- H W , (4.2) 

where V^- R is as defined in (4.1) and the operator Q: C(K_,M d ) h-> C(M+,M d ) is 
given by: 

(Gw_)(t) = lH -g(-)w-(-r)dr . (4.3) 
Jo r KrJ 

Here, the kernel g is given by 

g(v) * v H ~^ + (H- 3/2) v / g ± >- du , (4.4) 

Jo (1 - u) H 1 ' 2 

and the constant jh is given by 7^ = (H — ^)o.hCXi-h, where an is the constant ap- 
pearing in (4.2). The interpretation of the operator Q is that (Qw-)(t) is the conditional 
expectation at time t of a two-sided fractional Brownian motion with Hurst parameter 
H, conditioned on coinciding with w- for negative times. 

Henceforth we will use the notation (4.2), namely we denote the past of the fBm by 
w_ e C(R_, R d ) and the future by w + e C(R, M. d ). At this stage, we will use a slight 
abuse of notation, and we will also sometimes interpret w + as an element in the space 
CJ(R + ,K' i ) of geometric rough paths that are 7-Holder continuous, although we then 
usually denote it by (A", X). 

In view of (4.2), it will be useful to clarify how to interpret this when the future is 
interpreted as an element in the space CJ(M + , M. d ), and for which instances of w_ the 
decomposition (4.2) makes sense. Recall that, for (X,X) e CJ([0, T], R d ) and h £ 
C([0, T], R d ) a path with bounded variation, we can define a translated path (Y, Y) = 
Th(X, X) in a natural way by 

Y t = Xt + h , Y a ,t = X S)t + / <5X s , r ® dft, r + / Sh s . r ® rfX r + / Sh s . r ® d/i r . 

J s J s J s 

(4.5) 

Since we assumed h to be of bounded variation, the integrals appearing in this expres- 
sion should be interpreted as usual Riemann-Stieltjes integrals. Assume furthermore 
that h is such that there exists a constant ||/i||i ;7 such that, for every s < tin [0, T], 
the variation of h over the interval [s, t] is bounded by ||/i||i ;7 |t — s| 7 . In this case, it 
follows immediately that there exists M (depending on T) such that 

\\Y - X\\, < WhW^ , llY-Xl^^M^ll^dl/ill^+IIXy. (4.6) 

Similarly, we check that there exists a constant M such that 

||7y l (X,X)-77,(X,X)|| 7 <M||/i||i i7 ||(A:,X)-(X ) X)|| 7 , (4.7) 
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so that Th is Lipschitz continuous as a map from C 7 ([0, T] , R d ) to itself. 



9 ' 

Denote now by W 7 the completion of C^°(K_ ; R d ) with respect to the norm 



\uj(t) — u)(s)\ , , „ N 

wL = sup !— . (4.8) 

For G (1/3, 1) and 7 G (1/3, H), it can be shown that there exists a probability 
measure P_ on W 7 such that the canonical process associated to P_ is a fractional 
Brownian motion with Hurst parameter H [Hai05]. 

Notice now that the operator Q given by (4.3) is actually defined on all of W 7 . 
Indeed, similarly to [HO07, Prop A. 2], it can be checked that the kernel g defined in 
Equation (4.4) is smooth away from and that its derivative satisfies 

ff'(t) = 0(l), K<1, g'{t) = Oit H -i) , t»l. (4.9) 
It follows that, for io_ G W 7 , one has the bound 

\{Gw-)Xt)\<C\w-li<f- x + V-*) t 

where ||| • ||| 7 is as in (4.8). In particular, over every finite time interval there exists a 
constant M such that 

||£w-||i ;7 < M|«;_| 7 . (4.10) 

As a consequence of this discussion, (4.2) makes sense in C 7 (R + , R d ) for every w- G 
W 7 . 

4.1 Derivatives of the solutions 

We now derive expressions for the derivatives of solutions to (1.1), both with respect 
to its initial condition and with respect to the driving noise. For this, we make the 
following assumption, which will be enforced throughout the whole article: 

Assumption 4.1 The vector fields Vj are C°° and all of their derivatives grow at most 
polynomially fast. Furthermore, for every initial condition z G M n , every final time T, 
and every {X, X) G X> 7 ([0, T],R d ) with 7 > \, (1.1) has a solution up to time T. 

Remark 4.2 Since the solution to (1.1) depends continuously on both its initial con- 
dition and the rough path (X, X) [FVlOb], it follows from a simple compactness argu- 
ment that, for every R > and every final time T > 0, there exists a constant M such 
that the bound 

\\(Z, Z')|U l7 <M, 

holds uniformly over all initial conditions \z\ < R and all driving noises ||(X, X)|| 7 < 
R. Here, we use the fact that, over finite time intervals, the embedding T>P <-t is 
compact for /? > 7 [FVlOb, Prop. 8.17] and that the continuous dependence on the 
driving path also holds in T>P. 

For an initial condition z and an instance of the driving noise w = (w-,w+), let 
$>t(z, w + ) denote the solution map of (1.1): 
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Note that for defining the solution, we only use w + and do not use u>_, the past of the 
driving noise. Define the Jacobian 

def d$ t (z,w+) 
Jo '* " Wz ' 

and, for notational convenience, set V = (Vi, V2, ■ ■ ■ , V^). Then the Jacobian J 0jt and 
its inverse satisfy the evolution equations 

dJ ,t = DVo(Zt) J .t dt + DV(Zt) J .t dX t , (4.11a) 
dJoj = -Joj DVo(Zt) dt - Jq" 1 DV(Z t ) dX t . (4.1 lb) 

Here, both J and J -1 are n x n matrices, and Jo.o = = !■ ^ n order to deduce 
(4.1 lb) from (4.1 la), we used the chain rule, which holds provided that (X, X) e CJ. 

We now consider the effect on the solution of a variation, not of the initial condition, 
but of the driving noise itself. For this, we define the operators At ■ L 2 ([0, T], R d ) i-> 
R n by 

A T v= [ Jo 1 s V(Z s )v(s)ds . (4.12) 
Jo 

A particular role will be played by A T : E" h-> L 2 ([0, T], M d ), the adjoint of ,A T) 
which is given by: 

(^0(*) = ^(Z 8 )*(J - 1 )*€. (4.13) 

It is known [FVlOb] that for every sample path w + of ffim in and for any fixed T, 
the map 1 

h € % + 1 ^ $ T (z, w+ + /i) (4.14) 

is Frechet differentiable, where Hh,+ denotes the Cameron-Martin space of the Gaus- 
sian process w + . Furthermore, setting h(s) = J Q S v(r) dr for some v, one has the 
identity 

D h $ T (z, w+ + h)\ h=Q = J , T A T v , (4.15) 

whenever v e L 2 . Note that, by (4.2), the space Hh,+ consist of those paths h such 
that h = T>i~ H h for some h in the Cameron-Martin space of W, which in turn is equal 
to H 1 , the space of square integrable functions with square integrable weak derivative. 
This shows that the corresponding element v does not necessarily belong to L 2 , so that 
one may wonder what the meaning of (4.15) is in general. Writing F s = J^l V(Z S ) 
as a shorthand, a calculation shows that there is a constant c such that 

A T V^- H v = cJ^ (jT {r- S ) H -i{F s -F r )dr+ iT - F s ^v(s)ds, (4.16) 

sothat\A T T>?~ H v\ < M\\F\\c-y\\v\\ L 2 for some constant M, provided that 7 > \-H. 
Since, in our particular case, F is a rough path controlled by (X, X), the condition 
ll-Fllc-r < 00 for 7 > 5 — H can always be satisfied when ^ — H < H, namely when 

In the sequel, we will write D V Z T as a shorthand for the derivative of the solution 
map in the direction h = J Q v(s) ds, i.e. 

D V Z T = D h <5> T {z,w+) = J ,tAtv . (4.17) 

'Since T>1 is not a linear space, the "addition" of the paths w+ and h should be interpreted in the sense 
of (4.5) below. 
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We also set 

B S Z Z = J s , t V{Z z s ) " J , t J - s MZ s z ) , (4.18) 
so that D v Zj, is the L 2 -scalar product of DZf, with v. 
4.2 Malliavin differentiability of the solutions 

Using the representation (4.2), the solution map $4(2, w + ) conditioned on the past w_ 
of the driving noise may be viewed as a functional of an underlying Wiener process 
on [0, 00) which then allows us to define the Malliavin derivative of the solution map 
HZ? = D$ t (z,w + ) in the usual way. As shown in [CFV09], the Malliavin deriva- 
tive is related to the Frechet derivative given by (4.14) in the following way. For any 
J Q v(s) ds € Hh,+, define v = T^~ H v. Then we have the identity 

j)-Z* = —V v Z 7 f = —J T A T v = —J T A T vi- H i , (4.19) 

where olh is as in (4.2). In line with the notation from [Nua06], we define s i-> T> S Z Z 
to be the stochastic process such that the relation 

T) C Z Z = [ v(s)T) s Z z ds , 
Jo 

holds for every v E L 2 . Comparing this with (4.16), we see that one has the identity 

H s Z z t = c [ (r - s) H -i(J Stt V(Z z ) - J r ,tV(Z z )) dr 

Js (4.20) 

+ Y^( T -^ l2 Js,v(z z s ), 

for some fixed constant c. (Furthermore, T> S Z Z = for s > t.) In general, we can 
rewrite this as 

T>Z Z = Vl~ H DZ z , (4.21) 

--H 

where T) S Z% is as in (4.18), with T) S Z% — for s > t, and is the linear operator 

given by 

(T>l~ H fX8) = cj (r- s) H -Hf(r) - f(s)) dr . 

The aim of this section is to obtain a priori bounds on the higher-order Malliavin 
derivatives of the solution. As a first step, we obtain pointwise bounds on multiple 
derivatives of the solution map. In view of (4.18), we will need to compute D s Jo ; t in 
order to obtain such bounds. At this stage, let us put indices back into the various ex- 
pressions in order to clarify the precise meaning of the various expressions that appear. 
We will use Einstein's convention of summation over repeated indices and we write D* 
for the derivative with respect to the ith component of the driving noise (X, X). It is 
clear that D s J .t — for t < s. Furthermore, we see from (4.1 la) that 

D l s J " s = D m Vf(Z s )J™* . (4.22) 

For t > s, we formally differentiate (4. 11a) and we use the identity (4. 18) to obtain the 
rough evolution equation 

dDl J™ = D m V*{Z t ) D* J$ dX\t) + D m V a k (Z t ) D\ J m / dt 
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+ D 2 mn V k (Z t ) J$J™V°(Z t ) dXHt) + D 2 mn V k (Z t ) J$J™V°(Z t ) dt . 

Note now that this is a linear inhomogeneous equation for D* jfi \, where the linear part 
has exactly the same structure as (4.11a). As a consequence, we can solve it using the 
variation of constants formula which, when combined with (4.22), yields the expression 

D* J« = J%D m V'{Z a )jtf + f J kp t D 2 mn Vf(Z r ) J^J™V°{Z r ) dX\r) 

+ f J%D 2 mn V p (Z r ) J$J™V°(Z r ) dr . (4.23) 

J s 

A similar identity also holds for D s J^ 1 , but the precise form of this expression is 
unimportant as will be seen presently. 

Denote by T the space of finite rooted trees and by F the space of finite forests 
(unordered finite collections of trees, allowing for repetitions). Formally, we denote by 
(n, . . . , Tfc) the forest consisting of the trees n, . . . , Tfc. For F = (n, . . . , Tfc) G F, we 
also write [F] = [ti, . . . , Tfc] G T for the tree obtained by gluing the roots of the trees 
of F to a common new root. 

For any forest F G F, we then build a sequence of subsets V F C C] c ([0,T\,W) 
with k > 1 in the following way. For the empty forest (), we set Vq = {1} for every 
k. For F — (•), the forest consisting of one single tree, which itself consists only of a 
root, we set 

Vt. ) = {JldJo'f,Df^ ie Vl(Z.) : £e{0,...,k}}, 

where m g {0, . . . , d}, and the indices i, j, and ii,...ie belong to {1, . . . , n}. For 
forests F = (n, . . . , r m ) consisting of more than one tree, we set 

K,,.,r m) = {*!••• Y m : Yj G ,Vj} . (4.24) 

In other words, the processes contained in V k Ti ^ are obtained by multiplying to- 
gether the processes contained in V k T .y Finally, if F consists of a single tree consisting 
of more than just one root, so that F = [G] for some forest G, we set 

V t fc G] = Y s dX\s) , J Y a ds : Y G V k G , t G {1, . . . , d}} . (4.25) 

This construction has the following feature: 

Lemma 4.3 There exists a map F ^Tp from F to 2 F , the set of subsets ofF, with the 
following properties: 

• The set Tp is finite for every F G F. 

• For every F € F, k > 1, there exist coefficients c l YV ^ taking values in {0, 1, — 1} 
such that the identity 

E E E C Y,U,uUsUt 

G,GeT F u&v% +1 uevl+ x 
holds for every Y G V F and every < s < t < T. 
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Proof. Note first that, by writing J r t = JojJq I and similarly for J sr , we see from 
(4.23) that D s Jo,t can indeed be written as 

= E E E <%vu.u t , 

G,GeTj c/evg t/evg 
for some coefficients e {0, 1, —1}, and for 

Tj = {(.),(.,.,.),(.,[., .,.,.,.])}. 

The same statement holds true for D*( J ~ t 1 ) fc£ . Furthermore, we have from (4.18) and 
the chain rule the identity 

^ s Dfl.. M Vi{Z t ) = Df+X k Vi(Zt)J* p t Vf(Z s ) , 

so that we have 

ViD?l.. e Vi(Z t )= ]T Y, Y~ c u,uU s Ut, 

for some coefficients Cjjjj £ {0, 1, —1} (depending also on all the indices appearing 
on the left hand side), and for 

t d = {(; •)} • 

It follows that we can indeed find a set T ( .) = Tj |J Td with the two properties stated 
in the lemma. 

For more complicated forests, the claim follows by building T recursively in the 
following way. If F — (n, . . . , r m ) for trees Tj such that T( Tj ) is known, we observe 
that one has the identity 

m 

TP B Yx(t) . . . Y m (t) = • • • ii-iW D i^W • • • ^m(*) • (4-26) 

i=l 

As a consequence, if we write F (B G for the union of two forests and F Q G for the 
forest obtained by removing from F its subforest G, we can set 

m 

T( T1 ,...,T m ) = \J{T (Ti) , f e in) e r (T4) } . (4.27) 
»=i 

It follows from (4.26) and (4.24) that this definition does indeed ensure that the re- 
quested properties are satisfied. 

It remains to consider the case F = (r) for some non-trivial tree r. In this case, 
there exists a forest G such that r = [G] and elements in Vp are given by (4.25). Note 
now that one has the identities 

D* / Y r dX\r) - 5 U Y S + f B l s Y r dX\r) , 
Jo Jo 

D* [ Y r dr= [ T) l s Y r dr. 
Jo Jo 

As a consequence, if we set 

Tia — Tq U {G, ()} U {([#]) :if£T G }. (4.28) 

the requested properties are again satisfied by induction. Since every forest can be 
built from elementary trees by the two operations considered in (4.27) and (4.28), this 
concludes the proof. □ 
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For our purpose, this has the following useful consequence. For a fixed final time 
T, define the controlled rough path J z G V]([0, T], R 2 ™ 2 +") by 

J t '=(.Z t ,J 0>t ,Jo t t). (4.29) 

(Note that both J and J -1 also implicitly depend on the starting point z.) We then have 
an a priori bound on the derivatives of the solution with respect to the driving noise in 
terms of J z : 

Proposition 4.4 Let A t denote any component of the vector J£ . Under assumption 4.1, 
for every multiindex a — {ol\, . . . , at) there exists a finite index set T a and elements 
G C x ([0, T], M) with keT a and j G {1, . . . , |a| + 1}, such that the identity 

D" 1 • • • D%A t = Yl F ^ ■ • ' F h*t)Ft +1 (t) , (4.30) 

keT a 

holds for every < si < . . . < S£ < t < T. 

Furthermore, there exist constants M and p depending only on a and T such that 
the bound 

\\Ff\\x tl <M{l + \\{J z ,J z, )\\ Xtl f , 
holds for every k and j. 

Proof. The first claim follows immediately from Lemma 4.3 by induction on \a\. The 
second claim follows from the construction of the sets V F , combined with Lemma 2.4 
and Theorem 2.2. □ 

In the particular case when X is fractional Brownian motion with Hurst parameter 
H, it follows from Proposition 4.4 that, if we consider the Malliavin derivatives 
with respect to the underlying Wiener process as at the beginning of this section, we 
have the following bound: 

Theorem 4.5 As above, let A t denote any component of the vector J t z and let (X, X) 
be fractional Browninan motion with Hurst parameter H G (4, \). Under Assump- 
tion 4.1, for every multiindex a — (cti, . . . , a£), every 7 6 (|, H), every 5 > 0, and 
every T > 0, there exist constants M and p such that the bound 

(|D£ • • -D«M SW I H \s j+1 - S /- 2H+5 ) < M(l + \\{J',j")\\x„) p , 
3=1 

holds uniformly for all < s± < . . . < sg + i < T. Furthermore, the exponent p can 
be chosen to depend only on \a\. 

Remark 4.6 Since the function 1 1-> t 2H ^ 1 ^ s is square integrable near the origin for 5 
sufficiently small, the random variable A t belongs to the stochastic Sobolev space Dj^. 
If furthermore E||(J7", »7')|lx ~ < 00 f° r every p, then A t belongs to the stochastic 
Sobolev space D°°. See [Nua06, p. 49] for the definitions of Dj^ and D°°. 

Proof. Consider now si < ... < to be fixed and consider, for j = 0, . . . ,£, the 
sequence of functions 

i^Vj+i, • • • ,r t ) = D S1 • - I). I), • • • D re A t , 



Malliavin derivatives 



21 



so that our aim is to obtain a bound on Note that, by (4.21), the satisfy the 
recursive formula 

(r - Sj ) H -i(F«- 1) (r,r J+1 ,. ..,r,) (4.31) 

3 

-F ( ^ 1 \s 3 ,r 3+l ,... 1 n))dr . 

We claim now that, for every j, there exists an index set Tj and a family of functions 
F-*' ) such that, for Sj < r J+ i < . . . < re < t, one has the identity 

t 

keT 3 i=j+i 

Furthermore, for every /3 G (| — H,H), there exists a constant M independent of 
si, . . . , s< such that these functions satisfy the bound 

II II^MIw < M[[ - ^""^(l + \\(J z ,J z ')\\x,,) P , (4.33) 

for some fixed p > 0. Here, where we denote by H-FH/jj the C^-norm (not seminorm!) 
of F, restricted to the interval [Sj + i,t]. In the special case j = I, this is just 
Once we show that (4.32) and (4.33) hold, the proof is complete since the special case 
j = I and the choice = -i — H + S yields the stated claim for 6 sufficiently small. 
For larger values of S, the claim can easily be reduced to that for small 8. 

Note furthermore that F^\rj + i, . . . , r^) = if there exists i > j such that > t 
and that the function is symmetric under permutations of its arguments. As a 
consequence, (4.32) is sufficient to determine F^\ 

The proof now goes by induction over j. For j = 0, we have 

F m ( ri ,...,n) = D ri ---B re A t , 

which is indeed of the form (4.32) by Proposition 4.4. In this case, the bound (4.33) 
reduces to the statement that the F- ^ are /3-H61der continuous, which is also a con- 
sequence of Proposition 4.4. In order to make use of the recursion (4.31), we have to 
rewrite it in such a way that the arguments of are always ordered. Using the 

recursion hypothesis, we then have the identity 



F«\r j+U . . . ,r/) = c Yl - (if -1,fc) (r) - if _1,fc) (* J -)) dr 

feeT,_i •' s i 

i>j 

+ c J2 2 f r,+ \r-83) H -*FV- 1,k \r)dr 



keTj-t i>j 
i-1 

x ; 

q=j q=i+l 

2c 



1 - 2H 

i>j 
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= Ti + T 2 + T 3 . 
Rewriting the integral from to n+i appearing in T 2 as 

f\r- Sj ) H -iFl 3 - 1M (r)dr- f (r - s 3 ) H -i F^~^ k \r) dr , 

we see that is indeed again of the form (4.32). It remains to show that the bound 
(4.33) holds. To show that it holds for Ti, write 

G k (s) = f (r - Sj ) H -i(F^ "^(r) - if _1,fc) (*i)) dr , 

so that one has, for s > Sj, the bound 

\d s G k (s)\ <(s- sj^-^WFf-^Wpj-! . 
In particular, one has for s > Sj + i the bound 

\d s G k (s)\ < (s j+1 - Sj ) H -i(s - Sjf-'WF^-^yj^ . 
Furthermore, we obtain in a similar way the bound 

|G fc (* j+ i)| < M(s j+1 Sj ) H -^\\F^ - 1 ' fe) || /3lj -i , 
for some constant M, so that a straightforward calculation yields 

||G fc || w < M(s j+1 s^WF^y^ , 

for some constant M. The requested bound on T\ (actually a bound that it better than 
requested) then follows at once. 

To bound T 2 , we proceed similarly by setting 

G k (s)= [ (r-sj^-iF^^dr, 

J s 

and noting that G k (t) = and 

\d s G k (s)\ < (s j+1 Sj ) H -^(s sjf-'WFt^hj-i ■ 
It follows as above that 

||G fc || w < M(s j+1 sj^-^WFt^hj-i , 
as requested. Finally, the bound on T 3 follows in the same way. □ 

5 Regularity of laws 

Our aim in this section is to show that, if the vector fields V satisfy Hormander's 
celebrated Lie bracket condition (see below), then the Malliavin matrix of the process 
Z t is almost surely invertible and to obtain quantitative bounds on its lowest eigenvalue. 

In order to state Hormander's condition, we define recursively the families of vector 
fields 

V = 04 : k > 1} , V n+1 = V„ U {[U, V k ]:UGV n ,k>0}, 

where [U, V] denotes the Lie bracket between the vector fields U and V. Note that 
under Assumption 4.1, the elements in V n also have derivatives of all orders that grow 
at most polynomially. We now formulate Hormander's bracket condition [H6r67]: 
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Assumption 5.1 For every z e R n , there exists JVeN such that the identity 

span{U(z a ) : U G V N } = R n (5.1) 

holds. 

It is well-known from the works of Malliavin, Bismut, Kusuoka, Stroock and oth- 
ers [Mal78, Bis81, KS84, KS85, KS87, Nor86, Mal97, Nua06] that when the driving 
noise X is Brownian motion, one way of proving the smoothness of the law of Zt 
under Hormander's condition is to first show the invertibility of the "reduced Malliavin 
matrix" 2 

C T * A T A* T = [ JolV(Z s ) V{Z s )*{JolT ds . (5.2) 
Jo 

Recall that the matrix norm of a symmetric matrix is equal to its largest eigenvalue. 
Since Ct is a symmetric matrix, one can write the norm of its inverse as 

IIC" 1 !!"^ inf (v,C T v), V eM". (5.3) 
IMI=i 

Before we turn to our bound on the inverse of Ct, let us introduce some notation. 
For any smooth vector field U, define the process Zu(t) = J^] U (Z t ), and set further- 
more 

K z = 1 + Lg(X)- 1 + \\(X, X)|| 7 + \\(J Z , jT')|k 7 + \z\ , (5.4) 

where J z is as in (4.29). Here, we fix a "roughness exponent" 6 > H which will 
appear n subsequent statements. 

Lemma 5.2 Fix a final time T > 0. Under Assumption 4.1, there exist constants 
c, a > and M > such that the bound 

\\(ip,Zu(-))\\oo<Mn c \(^c TV )\ a , 

holds for all U E V\, all ip £ M™ such that \\<p\\ = 1, all initial conditions z and all 
(X,X) eV]([0,T],R d ). 

Proof. By definition we have 

d „T d 

(<p,C T <p) = ]T / (<p, J^VdZs)) 2 ds = ]T || (<p,Z Vt U)\\hio,Ti ■ (5-5) 
i=i Ja i=i 

To obtain an upper bound of order | (<p, CT<f)\ a on the supremum norm, our main tool 
is the interpolation inequality 

H/IU < 2max (T-^\\f\\ LHQtTh \\f\\^ T] \\f\\f*) , (5.6) 

which holds for every 7-Holder continuous function / : [0, T] i->- R (see for example 
[HP11, Lemma A. 3]). Since in our case the final time T is fixed, the L 2 norm is 
controlled by the 7-Holder norm, so that 

||(^^ i (0)||oo<M||(^Z yi (0)||f!| T] ||(^^(-)>||^ FT 

2 This is a slight misnomer since our SDE is driven by fractional Brownian motion, rather than Brownian 
motion. One can actually rewrite the solution as a function of an underlying Brownian motion by making 
use of the representation 4.2, but the associated Malliavin matrix has a slightly more complicated relation to 
C't than usual. Still, it will be useful to first obtain a bound on the inverse of Ct ■ 
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Since the vector fields Vi have derivatives with at most polynomial growth by as- 
sumption, we obtain immediately from Lemma 2.4 the bound 

\\{<P,Z Vi (-))\\cy <MTZ a , (5.7) 
for some exponent a. Combining this with (5.5), the claim follows at once. □ 



The next lemma involves an iterative argument (similar in spirit to [KS85, Nor86, 
BH07]) to show that a similar bound holds with Vj replaced by any vector field ob- 
tained by taking finitely many Lie brackets between the Vj's. 

Lemma 5.3 Fix a final time T > 0. Under Assumption 4.1, for every i > 1, there exist 
constants Ci,di > and M > such that 

\\(y,Z u (-))\\ 00 <M1l Ci \{ip,C T y)\ ai , 

for every U G Vi, every ip el™ such that \\ip\\ = 1, every initial condition z and every 
(X,X) G V]([0,T],R d ). 

Proof. The proof goes by induction over i. We already know from Lemma 5.2 that the 
statement holds for i = 1. Assume now that it holds for some i > 1 and let us show 
that it holds for i + For any t <T and U G Vi, a simple application of the chain rule 
(which holds since (X, X) is assumed to be a geometric rough path) yields 

(<p,Z u {t)) = / (^Z lVo . U] ( S ))ds + V / {y,Z [Vjm {s))dXi, (5.8) 
Jo i=1 Jo 

where the second integral is a rough integral as in Theorem 2.2. 

First we derive a priori bounds on the two integrands of (5.8) and then apply Theo- 
rem 3.4. It follows from Lemma 2.4 and Assumption 4.1 that 

\\(<p,Z [Vj!U] (-))\\ x ^<M1l a , j = 0,...,d, (5.9) 

so that a similar bound holds on || (<p, -2[v^,i/](0)|l7- 

By the induction hypothesis, for every U G V t we have the bound || (ip, Zjj{-)) ||oo < 
MlZ Ci \{p,CT ( p)\ ai for some constants a,i,Ci. Applying Theorem 3.4 to (5.8) and 
using the a priori bound (5.9), we conclude that there exist constants Qj+i, Cj + i such 
that 

\\{ V ,Z [uya {.))\\oo < MTZ Ct+1 \( V >,C T <p)\ at+1 , 

for £ = 0, . . . , d. Since V,+i contains precisely the vector fields [U, Vg], this concludes 
the proof. □ 

Now we combine the above two lemmas and Hormander's hypothesis, Assump- 
tion 5. 1, to obtain lower bounds on the smallest eigenvalue of Ct- 

Proposition 5.4 Assume that Assumptions 4.1 and 5.1 hold. Fix T > and let the 

matrix Ct and the quantity 1Z be as defined in (5.2) and (5.4) respectively. Then there 
exists a constant c > such that the bound 

inf \(<p,C T <p)\ >M1Z- C , (5.10) 



holds uniformly over every driving path (X, X) G T>J and every initial condition z. 
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Remark 5.5 We emphasize again that (5.10) yields a lower bound on the eigenvalues 
of Ct that is not probabilistic in nature. All the probabilistic cancellations that take 
place in the classical probabilistic proofs of Hormander's theorem are "hidden" in the 
strict positivity of Lg(X) and the boundedness of ||(X, 



Proof of Proposition 5.4. Let N <E N be such that 

K = l i ^ lzll i f, E \{<p,mz))\ 2 >o. (5.ii) 

\z\<n lMI=i r ~- 

The existence of such an N follows from Assumption 5.1 and the smoothness of the 
vector fields V. 

Note now that, considering the right hand side at time 0, we see that 



\{<P,U{zoW< \\{<P,ZuU) || i,. 
From Lemmas 5.2 and 5.3, there then exist constants cat, ajv such that 

K< inf sup 11^,^(0)11^ <M^ inf \(<p,C T <p)\ a » , (5.12) 
IMI =1 uev N \\<p\\ =1 

which is precisely the required bound. □ 

Now let M.t be the Malliavin matrix of the map W i-> Zy. where W is the un- 
derlying Wiener process from the representation (4.2). Then we have the following 
pathwise bound on Air- 

Theorem 5.6 Under the assumptions of Proposition 5.4, there exists a constant c\ > 
such that the bound 

inf \{(p,M T <p)\ > MU~ C1 , (5.13) 

IMI=i 

holds for every driving path {X, X) £ £> 7 and every initial condition with \z\ < TZ. 
Proof. By virtue of (4. 19), we have the identity 

\(<p,M T <p)\ - \\(V 1/2 - H yA^Jl T <p\\l 2l0 , T] (5.14) 

where (V 1 / 2 ' 11 )* is the L 2 [0, T] adjoint of the operator V 1 / 2 '" defined in (4.1). No- 
tice that I 1 / 2 - 11 : L 2 [0,T] i-> L 2 [0,T] is a bounded operator and since I 1 / 2 -" 
and V v l 2 ^ ii are inverses of each other, we conclude that operator (V 1 / 2 ^ 11 )* has a 
bounded inverse in L 2 [0, T]. Thus 

||p 1 /2-H ) *^j* T ^|| i2[0iT] > M||^J * T ^||| 2[0;T] = M{Jl T v,C T Jl T v) , 

which, from Proposition 5.4, is bounded from below by 

MK- c \\JI tV \\ 2 > MTZ- Cl \\<fi\\ 2 , 

where the last bound is a consequence of the fact that || Jq^ || < M1Z. □ 
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5.1 Probabilistic Bounds and Smoothness of Laws 

Recall from (4.2) that the "future" evolution of the ffim conditional on the past tu_ 
may be expressed as 

w+ = Qw- + a H T>i~ H W , 

where Qw- is the conditional expectation with the operator Q given by (4.3). As in the 
previous section, we will mostly be interested in the situation when W- is fixed and the 
conditional law of the solution is considered. 

One problem is that it is in general quite difficult to obtain moment bounds on the 
Jacobian (and its inverse) for equations of the type (1.1) when the driving noise is only 
7-H61der for some "f > \- (Rather than 7 > |.) The best bounds obtained in [FVlOb] 
rule out a downright explosion of the Jacobian, but only yield logarithmic moments 
in general. The very recent article [CLL11] obtains such moment bounds, but under 
boundedness conditions that are stronger than Assumption 4.1. See also [FR1 1] for a 
related result. We therefore state the moment bounds on the solution and its Jacobian 
as an additional assumption: 

Assumption 5.7 There exists an exponent ( < 2 and a seminorm ||| • ||| on C(R_ , R d ) 
such that \w- 1|| is almost surely finite and such that, for every R > and every p > 1, 
the bound 

n\(J z , J z ')\\ P x^ <Mexp(M|u;_|<), (5.15) 
holds for some constant M, uniformly over all initial conditions with \z\ < R. 

Remark 5.8 Combining (5.15) with Fernique's theorem immediately yields the un- 
conditioned bound 

E\\(J z ,J z ')\\ P x, 7 <M , (5.16) 

for any p > 1. 

Now we combine the results above with the results from the previous section to ob- 
tain probabilistic bounds on the inverse of the Malliavin matrix, under the additional 
hypothesis that Assumption 5.7 holds. 

Proposition 5.9 Let (1.1) be such that Assumptions 4. 1, 5. 1, and 5. 7 are satisfied. Fix 
T > and let Aix be the Malliavin matrix as in (5.13). 

Then, there exists a norm ||| • ||| such that |||w_ ||| < 00 almost surely and, for any 
R > and any p > 1, there exists M such that the bound 

f( inf (<p,M T <p) <e) < Me M ^ w -^ e p , (5.17) 

holds for all e € (0, 1] and all initial conditions z with \z\ < R. Here, the constant ( 
is as in (5.15). 

Similarly, the unconditional bound 

p( inf ((p,M T <p) < e] < M e p , (5.18) 

V|MI=i / 



holds. 
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Proof. From Theorem 5.6 we deduce that for small enough e, 

pY inf \((p,M T <p)\<e) <¥(K>£- C1 ) , 
\\\<p\\=i ) 

for some constant c\ > 0. By Markov's inequality, for any p > 1, this expression is 
bounded by 

Me pci EK P . 

Now for any p > 1, from Lemma 3.9 and Assumption 5.7 it follows that 

Furthermore, it follows from (4.2) that E\\(X, X)\\p < M e M ^ Sw ~ ^, thus proving the 
claim (5.17). The second claim then follows from Fernique's theorem. □ 

As an immediate corollary, we obtain that the Malliavin matrix has all moments: 

Corollary 5.10 Under the assumptions of Proposition 5.9, the matrix Air is almost 
surely invertible and, for any R > and any p > 1, 

E(\\M T P \\) < Me M ^ w -^ , 

uniformly over all initial conditions z of (1.1) such that \z\ < R. 

As a consequence of Proposition 5.9, we obtain the smoothness of the laws of Z t 
conditioned on an instance of the past w_ : 

Theorem 5.11 Let (1.1) be such that Assumptions 4.1, 5.1, and 5.7 are satisfied. 

Then, for every realization of the past u>_ with \w- ||| < oo, every initial condition 
z and every t > 0, the conditional distribution of has a smooth density with respect 
to Lebesgue measure. Furthermore, every derivative of this density has finite moments 
of all orders with respect to the law ofw-, so that the unconditioned distribution of 
also has moments of all orders. 

Remark 5.12 The norm ||| • ||| appearing in the statement is the same as the one appear- 
ing in Assumption 5.7. 

Proof. Combining Theorem 4.5 with Assumption 5.7, we see that the random variable 
Zl belongs to the space D°°. The claim then immediately follows from the fact that 
the Malliavin matrix has inverse moments of all orders [Nua06]. 

The claim about the moments of the density follows from the fact that, by (5.15), 
HA^ 1 !! and all Malliavin derivatives of Z% also have unconditional moments of all 
orders. □ 

5.2 A cutoff argument 

While [CLL11] provides a large collection of examples for which Assumption 5.7 
holds, this condition is not always easy to check. In this section, we therefore pro- 
vide a cutoff argument that allows to still show the existence of a density for the law 
of the solutions to (1.1) under Hormander's condition, without assuming that Assump- 
tion 5.7 holds. Actually, we show slightly more than the mere existence of a density, 
namely we show that the density can be approximated from below by a sequence of 
smooth densities. More precisely, the main result of this section is the following: 
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Theorem 5.13 Assume that Assumptions 4.1 and 5.1 hold, and denote by /i t the con- 
ditional law of the solution to (1.1) at time t > 0, with fixed initial condition z € K™. 

Then, there exists a sequence of increasing positive measures /i™ with C°° densities 
q™ such that lim rl _ ! . 00 fJ.t(A) ~ fit(A) for every Borel set A. In particular, /it has a 
density gt with respect to Lebesgue measure and lim n ^oo Qt( x ) = Qt{x) for Lebesgue- 
almost every x. 

Remark 5.14 The statement that g t can be approximated from below by smooth func- 
tions is strictly stronger than just g t G L 1 , which was already obtained in [CF10]. An 
example of a density function that cannot be approximated in this way would be the 
characteristic function of a Cantor set with positive Lebesgue measure. 

Proof. The idea is to perform the following cutoff argument. For f3 > real, T > t 
and q > 2 an even integer, we define the function 

A fYV* F f* \tXs,t\ 2q + \\t st \\0 j ^ 

A fs ^ T (X,X) = ^ \ t _ s \2M dsdt > (5A9 ^ 

where we denote by X the antisymmetric part of X. This function has the following 
desirable properties: 

1. From the scaling of the covariance function for fractional Brownian motion and 
the equivalence of moments for Gaussian measures, we conclude that if (X, X) 
is fractional Brownian motion with Hurst parameter H, then Ap^^iX, X) has 
finite (conditional) moments of all orders, provided that (3 < H. 

2. For every 7 6 (0, H) and every j3 e (7, H), there exists q > and M > such 
that 

\\X\\* + ||X|| 27 < MAg T (I,X) . (5.20) 

A proof of this fact can be found in [FVlOb, p. 149]. Note that since we assume 
(X, X) to be geometric, the symmetric part of X Sji is given by 6X Sjt <8> SX Sit , so 
that it is indeed sufficient to control the increments of X and the antisymmetric 
part of X. 

3. For every (X, X) e V 1 , the map 

H H ,+ 3h^Ap iqiT (T h (X,X)), 

is Frechet differentiable to all orders [FVlOb], where t/j is the "translation map" 
as defined in (4.5) below. In particular, the map w + i-> Ap^ q ^T(X(w + ), X(w + )) 
belongs to the space D°° of random variables that are Malliavin differentiable 
of all orders with all Malliavin derivatives having moments of all orders. The 
precise statement of this fact is given in Proposition A. 1 in the appendix. 

The proof is now straightforward. First of all, we let 7 < H be as in the previous 
sections, let (3 £ (7, H), and fix q large enough so that (5.20) holds. We also let 
X- K+ — > R+ be a C°° non-increasing cut-off function so that x(A) = 1 for A < 1 and 
X(A) = for A > 2. With these definitions at hand, we set 



*n(w+) = X (n- 1 A^ T (X(w + ),X(w + ))) 



(5.21) 
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Fix furthermore z <G W 1 and as before denote by &t(z, w+) the Ito map, so that /i t = 
$t P. We then set /i™ = (*nP)- In other words, we have the identity 

M t "(A)= / V n (w + )V(dw + ) , 

valid for every measurable set ^4 C K™. Since Ap, q ,T is almost surely finite, we clearly 
have ^t(A) for every measurable set A, so that the claim follows if we can 

show that every /x™ has a smooth density. This in turn follows by Malliavin's lemma 
[Nua06] if we are able to show that, for every bounded open set K C W 1 and every 
multi index a, there exists a constant M such that the bound 

E(D a G($ t (z, «;+))*„(«;+)) < M(w_) sup \G(x)\ , 

holds uniformly over all test function G : W 1 — »■ R that are C°° and supported in K. 

Let a = (ai, «2, ■ • • , «fe), a, € {1, 2, • ■ • , n}. Using the chain rule and the inte- 
gration by parts formula from Malliavin calculus, we have the identity 

E(D a G($ t (z,w + ))* n (w+)) =ti(G{Q t (z,w+))H a (* t (z, «;+),*„(«;+))) , 

(5.22) 

where the random variables H a are defined as follows. For a = (f>, the empty multiin- 
dex, we set = Furthermore, given a random variable G and an index at\, we 
set 

n 

H ai (G) = D*(G Y / (Mt 1 ) ai jD j $t(z,w + )) . (5.23) 

with these definitions at hand, it is straightforward to see that, for a = (ai, . . . , a/c), 
we have 

-ff Q Ra 1 {H(a2,...,a k )) ■ 
Fortunately, all of these expressions can be controlled in the following way. Define the 
set 

S n = {w : Af, tgtT (X(w+), X(w+)) < 2n} . (5.24) 

It then follows from the local property of the Skorokhod integral [Nua06, Prop. 1.3.15] 
that H a (w + ) = for w + S n . As a consequence, we also have the identity 

H a W ai (ff(a 2r ..,a| i )) > 

where 

n 

U ai (G)=Y)*(GY J ^2n{Mt 1 ) ai] )W(^2n<S>t(z i W + ))) . 

J'=l 

Note now that, by Corollary A. 2, Theorem 5.6, and Theorem 4.5, both fl^nCMj ) ai j 
and ^2n^t(z,w + ) belong to the stochastic Sobolev space D l - p for every £,p > 1, 
uniformly over every w- such that ||i0-|| 7 < R for any R > 0, where |tu_|| 7 was 
defined in (4.8). 

As a consequence, for a — (cti , . . . , o>k) an d ^ > 0, we have the bound 

EUDWff Q r <K(ii| W _i) (nv (m) H (a2 ,..., a j\ 2 n 1/2 , 

m<i+l 

where we denote by D (fe) the fcth iterated Malliavin derivative and by || • || the i 2 -norm. 
Since also belongs to D £ p for every £, p > 1 by Corollary A.2, the claim then 
follows. □ 
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6 Ergodicity of SDEs driven by fBm 

The aim of this section is to use the preceding results in order to obtain ergodicity 
results for stochastic differential equations driven by fractional Brownian motion. In 
order to do this, we make use of the abstract framework introduced in [Hai05] and 
further refined in [HO07, Hai09]. This allows to introduce a notion of a "strong Feller 
property" for a large class of equations driven by non-white noise, together with a 
corresponding version of the Doob-Khasminski theorem, stating that the strong Feller 
property, combined with a form of topological irreducibility and a quasi-Markovian 
property, is sufficient to deduce the uniqueness of an "invariant measure" in a suitable 
sense. 

In order to use this framework, we view solutions to (1 . 1) as a discrete-time Markov 
process on a space of the type W x M™, where W contains all the information about 
the driving noise X required to solve (1.1) over a (fixed) time interval 1 and to predict 
the law of its future evolution. In our case, it is natural to choose W to be of the form 

W = W- © W+ , 

where W_ contains the "past" of the driving noise up to time 0, and W+ contains the 
noise between times and 1. The reason for splitting our space explicitly into two parts 
is that in order to be able to give a meaning to solutions to ( 1 . 1 ), we consider the driving 
noise as a rough path, i.e. we choose W+ = X> 7 ([0, 1], R d ) for some 7 e (^,H). 
(Recall that X> 7 is the closure of the set of lifts of smooth functions in P 7 .) 

On the other hand, in order to recover the conditional law of fractional Brownian 
motion given its past, iterated integrals are not needed and it is sufficient to retain 
information about the path itself. Therefore, it makes sense to choose W_ in a way 
similar to [Hai05], namely we choose W_ = W 7 for some 7 < H, where W 7 was 
defined in (4.8). Denote as before by P_ the measure on W 7 such that the canonical 
process is a fractional Brownian motion with Hurst parameter H under P_ . 

For any given w_ € W_, we now construct a measure on W+ as the law of a two- 
sided fractional Brownian motion, conditioned on its past W-, and enhanced with the 
corresponding "area process". More precisely, denote by P + the law of the stochastic 
process {X t }te[0,i], given by 

rt 

X t = a H (t-r) H -idW r , (6.1) 
Jo 

where W is a standard Wiener process and a# is the constant appearing in (4.2). In 
can be checked that the covariance of P + satisfies the assumptions of [CQ02, FVlOa], 
so that it can be lifted in a canonical way to a measure P + on W+. 

With this definition at hand, we define a Markov transition kernel V from W_ to 
W+by 

V(w-,-) = t£ w _F + . 

It follows from (4.10), (4.6), and (4.7) that V is Feller. Furthermore, it determines a 
measure P on W = W_ x W + in a natural way by 

P(dw_ x dw + ) = W_(dw-)V{w-,dw + ) . 

It follows from our construction that if we denote by II: W — > C((— 00, l],M d ) the 
natural map that concatenates w_ with the "path" component of w + , then the image 
of P under II is precisely the law of a two-sided fractional Brownian motion with 
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Hurst parameter H . Similarly, we have a natural shift map : W — > W_ that con- 
sists of composing IT with the usual time-1 shift map that maps C((— oo, 1], R d ) into 
C((-oo, 0], M d ) D W_. It follows from the definitions of W_ and W+ that the map 
is actually continuous. This construction allows us to lift V to a Feller Markov 
transition kernel on W by 

V(w, •) = 5e(») ® V(Q(w), •) . 

It also follows from our construction that P is invariant (and ergodic) for V. Indeed, the 
action of V(w, •) is to shift the "path" component of w backwards by one time unit and 
to then concatenate it with the canonical lift to W+ of a piece of two-sided fractional 
Brownian motion, conditional on its past being given by <d(w). 

We now combine the noise process (W,P, V) with the solution map for (1.1) in 
the following way. As before let $.(z, w + ) denote the map that solves (1.1) for a given 
initial condition z and a given realization w = (w- , w + ) of the driving noise. Since the 
Ito map is continuous on the space of rough paths with fixed Holder regularity [FV 1 Ob] , 
the map $ is continuous. We can then view the solutions to (1.1) as a Markov process 
on W 1 x W with transition probabilities given by 

Q(z,w;-) = $*7>(to, •) , 

where we define f^W^fxWby $z(w) = (®i(z,w + ),w). In other words, 
we first shift back the noise by a time interval 1, then draw a sample from the condi- 
tional realisation of an enhanced fractional Brownian motion on [0, 1], and then use 
this sample to solve (1.1) between and 1. 

The aim of this section is to show that the Markov operator Q admits a unique in- 
variant measure, modulo a natural equivalence relation described in Section 6.1 below. 
Note that while Q is Feller (since $ is continuous and V is Feller), but it is certainly 
not strong Feller in the usual sense. We will however show in Section 6.1 that there is 
a natural generalisation of the strong Feller property in this context that, in a way, only 
considers the part of Q in W 1 . In this generalised sense, it turns out that the invertibility 
of the Malliavin matrix shown in the preceding sections allows to prove that Q satisfies 
the strong Feller property in this generalised sense. Combined with a form of topolog- 
ical irreducibility and a "quasi-Markov" property, this is then sufficient to deduce the 
uniqueness of the invariant measure for Q modulo equivalence of the induced laws on 
the space of trajectories on W l . 

6.1 General uniqueness criterion for the invariant measure 

From now on, we use the notation X = R™ in order to simplify notations and to 
emphasise the fact that the results do not depend on the linear structure of the space. 

The aim of this section is to study the uniqueness of "invariant measures" for (1.1). 
The question of uniqueness of the invariant measure for the SDE (1.1) should not be 
interpreted as the question of uniqueness of the invariant measure for the Markov op- 
erator Q constructed in the previous section. This is because one might imagine that 
the augmented phase space X x W contains some "redundant" randomness that is not 
necessary to describe the stationary solutions to (1.1). (This would be the case for ex- 
ample if the Vi's, are not always linearly independent.) One would like therefore to have 
a concept of uniqueness for the invariant measure that is independent of the particular 
description of the driving noise. 

To this end, we introduce the Markov transition kernel Q from X x W to X N 
constructed in the following way. Denote by (z n , w n ) a sample of the Markov chain 
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with transition probabilities Q starting at (zo, u>o). We then denote by Q(zo, wq; ■ ) the 
law of (z\ , Z2, ■ ■ •)• (We do not include the starting point, consistent with the convention 
that £ N.) 

With this notation, we have a natural equivalence relation between measures on 
X xW given by 

^ ~ v Q^i^Qv . (6.2) 

In other terms, two measures on X x W are equivalent if they generate the same dy- 
namics in X. In the particular case when the process in X is Markov, Q is independent 
of w and the equivalence relation simply states that the marginals on X should agree. 
Denoting by || • || T y the total variation norm, this suggests that the following is a good 
generalisation of the strong Feller property to our setting: 

Definition 6.1 The solutions to (1.1) are said to be strong Feller if there exists ajointly 
continuous function I : X 2 x W — > M. + such that 

\\Q(z, w; ■) - Q(y, w; • )||tv < £(z, y, w) , (6.3) 

and £(z, z, w) = for every z £ X and every w £ W. 

We stress again that the definition given here has essentially nothing to do with the 
strong Feller property of Q. It rather generalises the notion of the strong Feller property 
for the Markov process associated to (1.1) in the case where the driving noise is white 
in time. See for example the review article [Hai09] for more details. 

Definition 6.2 The solutions to (1.1) are said to be topologically irreducible if, for 
every z £ X, w £ W and every non-empty open set U C X, one has Q(z, w;U x 
W) > 0. 

Remark 6.3 In order to prove topological irreducibility, one usually uses some form of 
the Stroock-Varadhan support theorem [SV72]. A version of this theorem was shown 
in the present context to hold in [FVlOb, Thm 15.63]. This shows that, in order to 
verify that (1.1) is topologically irreducible, it suffices to show that, for every x £ M™, 
the set of points that are obtained as the solution at time t = 1 to 

d 

x(t) = V (x(t)) + V5(as(t)) Ui(t) , x(0) = x , 

i=l 

with u £ C°°([0, 1], M d ) is dense in M™. 

The following result, which is a consequence of [HO07, Thm 3.10], is a generalisa- 
tion of the well-known Doeblin-Doob-Khasminskii criterion for the uniqueness of the 
invariant measure of a general Markov chain: 

Theorem 6.4 If the solutions to (1.1) are strong Feller and and topologically irre- 
ducible, then (1.1) can have at most one invariant measure, modulo the equivalence 
relation (6.2). 

Proof. The only missing ingredient to be able to apply [HO07, Thm 3.10] is the "quasi- 
Markov" property of the solutions to a stochastic differential equation driven by frac- 
tional Brownian motion with H £ (3 , i). For the case H > \, this property was shown 
to hold in [HO07, Prop. 5.11]. The proof in the case H < \ is virtually identical, so we 
only sketch it. It only uses the fact that the set X = {h £ C°°([0, 1], R d ) : ft'(0) = 0} 
has the following properties: 



Ergodicity of SDEs driven by fBm 



33 



1. The canonical injection X W+ has dense image in W+. 

2. The set X belongs to the Cameron-Martin space of P + , viewed as a measure on 
C([0,l],R d ). 

3. The set {Qh : h G X}, where Q is defined as 



belongs to the Cameron-Martin space of P + , viewed as a measure on C(K+, R d ). 
Indeed, because of the representation (4.2) and the properties of fractional inte- 
grals, it suffices to check that V H+ ^gh G L 2 (K + ,M <i ). for h G X. Using 
(4.9), an explicit calculation shows that V H+ ^Qh ~ t^~ H for t <C 1 and 
V H+ ?gh ~ t-?- H for t > 1 (see also [Hai05, Lemma 4.3]), so that this is 
indeed the case. 

Indeed, the first property ensures that, given any two open sets U, V G W+, we can 
find two smaller open sets U, V € W+, such that U C U, V C V, and V" = r^i?, with 
h <E X. Furthermore, P + (U) > 0, and so is F + (V) since the topological support of P + 
is all of W+ by [FVlOb, Thm 15.63]. 

Since X belongs to the Cameron-Martin space of P + , this guarantees that, for ev- 
ery w_ G W_, we can construct a subcoupling Vjj,v on Wl between V(w-, and 
P(w-, -)\y such that Vjjy charges the set of pairs (w + ,w + ) such that w + = tuw + . 
In order to check the quasi-Markov property, it now suffices to check that the measures 
Q(x, w; •) and Q(x, w; •) are mutually equivalent if (w, w) are such that their compo- 
nents in W_ are identical and their components in W+ satisfy w + = ThW + . This in 
turn is precisely the content of the third property above. □ 

The aim of the next section is to show that (1.1) does indeed possess the strong 
Feller property, provided that the vector fields {Vi} satisfy Hormander's bracket condi- 
tion. 

6.2 Verification of the strong Feller property 

The main result of this section is that the strong Feller property is a consequence of 
Hormander's bracket condition. 

Theorem 6.5 Under Assumptions 4.1 and 5.1, (1.1) is strong Feller in the sense of 
Definition 6.1. 

Remark 6.6 The main feature that distinguishes this situation from the usual one is 
that the process is not Markov. As a consequence, our definition of the strong Feller 
property implies that, as in [HO07], we need to construct a coupling between solutions 
starting from nearby points such that, with high probability, solutions agree not only 
after some fixed time (say 1), but also for all subsequent times. Furthermore, we will 
circumvent the fact that we do not assume a priori that the Jacobian of our solution has 
moments. This will be done by a cutoff procedure similar to [HO07]. 

Proof. Fix some arbitrary value N > 1 and a Frechet differentiable map ip : X N — > R, 
which is bounded with bounded derivative. Denote furthermore by Rjy ■ X N X N 
the projection onto the first N components, and set as before 
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so that Qtp : X x W -> R. 

The strong Feller property will follow if we show the existence of a jointly contin- 
uous function £ : X 2 x W R + such that 

|QV>0,w;)- QV-(2/,w)| <i(z,y,w), (6.4) 
for all Frechet differentiable functions ip with bounded derivatives such that 

sup \ip(x)\ < 1, 

uniformly for all iV > 1. 

To this end, set z s = zs + y(l — s) for s <E [0, 1] and £ = z — y. Let &[i,T](z, w + ) 
denote the solution to (1.1), restricted to the interval [1, T]. Since 

Qip(z, to) = Eip T (<f> [hT] (z, w+)) , 

where ip T is just ip, composed with the evaluation map at integer times, we have the 
identity 



QiP(z,w)-QiP(y,w)=t f DM^i,T](zs,w + ))J^4ds . 

Jo 



(6.5) 



Here, Jq . denotes the linearization of (1.1) with the initial condition z s . 

If moment bounds for the Jacobian Jq t are available, as in [HP1 1], we can proceed 
via a stochastic control argument using a Bismut-Elworthy-Li type formula [EL94] to 
show that that Qip(z,w) is actually differentiable in z. Since we do not assume this, 
we will combine this with a cutoff argument adapted from [HO07]. 

Recall the function A , q (X, X) = Ap, q ,i(X 7 X) from (5.19) with /3 > 7 and set q 
to be an even integer such that (5.20) holds. Similar to (5.21), define the cutoff function 

Hr(w + ) = X (^, q {X(w + ), X(w+))) , R > , (6.6) 

where \ '- ^+ — > M + is a C°° decreasing function with x(A) = 1 for A < 1 and 
X(A) = for A > 2. 

From (6.5) we obtain that 



\QiPt(z,w) - QipT{y,w)\ < 



+ 



E / y R (w + )DM$li.T](zs,w+))Jo.!ids 
Jo 

E (1 - V R (w+)) ^t($[i,ti(z, w+)) 
E(l - *«(«;+)) ip T ($[i,T](y,w + )) 



= T 1 +T 2 + T 3 . 
Since tpx is bounded by 1, we have the bound, 

T 2 +T 3 < 2v(w, [w+ J Ap, q (X(w + ),X(w+)) > 2i?}) 



(6.7) 



which can be made arbitrarily small by choosing R sufficiently large. 

For tackling the term T\, we now outline the stochastic control argument. Recall 
the operator A from (4.12). As explained in (4.17), the Frechet derivative of the flow 
map with respect to the driving noise w + in the direction of J Q v(s) ds is given by 

Jq T AtV . 
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The key idea underlying Bismut-type formulae is to use the relation (4.17) to convert 
the derivative of $t(z s , w+) with respect to its initial condition z s into a derivative with 
respect to the driving noise and to use the integration by parts formula from Malliavin 
calculus. 

To this end, given an initial displacement £ G R", we seek for a "control" v on 
the time interval [0, 1] that solves the equation A\v — £. If this can be achieved, then 
we extend v to all of M + by setting v(s) = for s > 1 and define v = I?~ H v. 
Note that since v(s) = for s > 1, it follows from the definition of At that we have 
Atv = A\v = £ for T > 1. If v is sufficiently regular in time so that v E L 2 (R + , R d ), 
we have the identity 

D C Z*' - J s iT C , (6.8) 
for every T > 1 and therefore, by the chain rule 

£>Vt($[i,t])Jo,.£ = Lhfr{*n,Ti)'DvZ! 1 ' = D 5 (^ T ($[i,T])) . (6.9) 

It remains to find a control v which solves A\v — £. From Proposition 5.4, C\ — 
A\A\ is invertible and therefore one possible solution to the equation A\v = £ is given 
by the "least squares" formula: 

vi^^AiaAiAir^x^^viZrriJo.irc^ 1 ^ reau). (6.10) 

Note that the control v also depends on the initial condition z s but since \z s \ < \z\\/\y\, 
all our estimates for the rest of the proof will be uniform in the initial condition by 
Remark 4.2. 

Inserting the identity (6.9) into the definition of T\, we obtain 



|Ti| = 



/ t(^ R (w+)Dyip T (^ [ltT] (z s ,w+)))ds 
Jo 



(6.11) 



lo 

Applying the integration by parts formula from Malliavin calculus, we obtain 



E (*fl(w + )D e ^ r ($ [ i,ri)) = e(Vt($[i,t])D*(*k( W+ )5)) 

< (E|D*(*«(*o+)t;)| 2 ) 1/2 , (6.12) 



where the second inequality follows from the fact that ipT is bounded by 1. To conclude 
the proof, it thus suffices to show that 

E(|D*(vI/^)| 2 ) < C(R, w_,z) |C| 2 , (6.13) 

where C is uniformly bounded on |||w-||| 7 < M and \z\ < M. 

Since the stochastic process v is in general not adapted to the filtration generated 
by the underlying Wiener process, we use the following extension of Ito's isometry 
[Nua06]: 

E(|D*(*^)| 2 ) =e(jT |*«5( S )| 2 d S ) 

+ E / / tr(T) t (y R v( S )) T T) s (y R v(t)))dsdt 
Jo Jo 

< E\\^ R v\\ 2 +E|(D(*^)|| 2 < c(E||*H! 2 + E||D*^|| 2 ) 
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= h+h- (6-14) 

Here, ||u|| denotes the L 2 -norm of v and similarly for Du. Since v = X H ~iv, the 
second inequality is a consequence of the fact that I H ~2 is a bounded operator from 
L 2 ([0, 1]) into L 2 (R+), see Corollary 6.8 below. 

The bound (6.13) on 1\ now follows immediately from Proposition 5.4 and Re- 
mark 4.2. The bound on I2 follows similarly by also using Theorem 4.5. The proof of 
Theorem 6.5 is complete. □ 

We now show that T^~ H is indeed a bounded operator from L 2 ([0, 1]) to L 2 (K+). 
For this, define the operator I a by 

{l a v){s)= [ |s-r| a- Vr)dr . 
Jo 

We then have: 

Lemma 6.7 For a £ (0, \), there exists a constant c such that, for positive v, 

\\2 a v\\ 2 < c\\v\\ \\l 2a v\\ . 

Proof. We have the bound 

\\Z a v\\ 2 = / / Is-rl"- 1 ^-t^- 1 dsv{r)v{t)drdt 
Jo Jo Jo 

< [ [ [ |s-r| a_1 |s-t| a_1 dat;(r)t;(t)drdt 

JO Jo J -00 

1 ,1 

j \r -tl 2 "- 1 dsv(r)v(t)drdt , 
Jo 

where the first step follows from the positivity of v and the second step follows from 
a simple scaling argument. Since this is nothing but c{v,I-2 a v), the requested bound 
follows from the Cauchy-Schwarz inequality. □ 

Corollary 6.8 For every a £ (0, 1), the operator I a is bounded from L 2 ([0, 1]) to 
L 2 < 



Proof. Note that 

\l a v(s)\ <l a \v\(s) < l a \v\(s) . 

Since \s — r| Q_1 is square integrable if a > |, the claim follows for that range of a. 
For smaller values of a, it is always possible to reduce oneself to the range (i, 1) by 
Lemma 6.7, noting also that \\l a v\\ < c\\Tpv\\ for a > 0. □ 



7 Examples 

In this section, we collect a few examples to which our main results apply. 
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7.1 Hypoelliptic Ornstein-Uhlenbeck process 

Consider the process x t given by 

dx = Axdt + CdB H (t) , (7.1) 

where x G K™, Bh is an m-dimensional fractional Brownian motion with Hurst pa- 
rameter H > |,i4isannxn matrix with A + A T < 0, and C is an n x m matrix. It is 
well-known that (7.1) satisfies Hormander's condition if and only if there exists k > 
such that the matrix (C, AC, . . . , A k C) has rank n. 

Since the Jacobian is given by J Sjt = exp(A(i — s)) and therefore has moments 
of all orders, we conclude that, for any initial condition x and for any sequence of 
times ti, . . . , tfc, the joint distribution of (a; tl , . . . , xt fc ) has a smooth density with re- 
spect to Lebesgue measure. Since this distribution is Gaussian, one could have verified 
directly that its covariance is non-degenerate, but this would have been a rather lengthy 
calculation. 

7.2 Linear equations / Levy area 

Let B be a rf-dimensional fractional Brownian motion and consider equations in W n 
of the type 

dX t = {AijkXj + C ik ) o dB k (t) , (7.2) 

where we use Einstein's convention of summation over repeated indices. In this case, 
the derivative of the solution with respect to its initial condition in a direction 77 e W n 
is nothing but the solution to 

dJi = A ijk Jj o dB k (t) , (7.3) 

with initial condition J(0) = r\. Similar formulae hold for higher order derivatives, 
so that it follows from the results recently obtained in [FR11] that Assumption 5.7 
is satisfied and our result on the smoothness of the densities applies, provided that 
Hormander's condition holds. 

As an immediate consequence, we have the smoothness of the Levy area, which 
was recently obtained independently in [DrilO]: 

Proposition 7.1 Let B be a d-dimensional fractional Brownian motion with Hurst pa- 
rameter H > 3 and let Wij(t) = J Q Bi(s) o dBj(s) — J Q Bj(s) o dBi(s) for i < j. 
Then, for any fixed t > 0, the vector (B k (t), Wij(t)) with k = l,...,d and i < j has a 
smooth density with respect to Lebesgue measure. 

Proof. The verification of Hormander's condition boils down to a simple problem in 
linear algebra. Writing ej for the basis vector in the direction Bj and fj for the basis 
vector in the direction Wij, we can rewrite x — B © W as the solution to the SDE 

dx = ^2 ( e i + f*i ( x ' e ' 1 ^ ~ X! ^ % ei dB i = ^2 V ^ dB 3 ■ 

j i<3 i>3 3 

An explicit calculation shows that, for j < k, we have 

[V j ,V k ](x) = 2f jk , 

so that Hormander's condition holds after one step. □ 

Remark 7.2 Higher order totally antisymmetric iterated integrals can be treated in 
exactly the same way with the fcth iterated Lie brackes recovering precisely the basis 
vectors of the elements in the fcth antisymmetric tensor. 



Examples 



38 



7.3 Simplified fractional Langevin equation 

Consider the process (qt,Pt) on R 2n given by 

dq=pdt, dp = -VV(q) dt-pdt + dB H (t) , (7.4) 

where, for the sake of simplicity, we assume that V : R n — > R + has bounded second 
derivative and there exist C > and n > such that 

{q,VV(q))>K\q\ 2 -C , V (q) > n\q\ 2 - C . (7.5) 

This equation is a simplified version of the fractional Langevin equation. (The equation 
satisfying the correct physical detailed balance condition would have a more compli- 
cated memory kernel instead of the simple friction term — p dt appearing above.) 

Because we assume V to have a bounded second derivative, the Jacobian of (7.1) 
is bounded by a deterministic constant over any finite time interval. Furthermore, 
Hormander's condition is easy to verify, so that we can apply Theorem 5.1 1 to infer the 
existence of smooth densities for the joint distribution of the solution at any time. 

Regarding the existence of a unique invariant measure for (7.1), it only remains to 
obtain a Lyapunov function for the solution to (7.4). For this, similarly to [Hai05], we 
proceed as follows. We consider the process (p, q) solution to 

dq = —qdt , dp — —dpdt + dBn(t) . 

It is of course trivial to bound solutions to this equation. Then, we set P = p — p and 
Q = q — q. The equation for (P, Q) can be written as 

Q = P + R Q , P= - W(Q) -P + Rp, 

where 

R Q =p-q, R P = W(Q) - W(Q + q) . 

Note that since we assumed that V has bounded second derivative, both Rp and Rq are 
bounded by a multiple of \p\ + \q\, independently of P and Q. We now set H(P, Q) = 
\P 2 + V(Q) + jPQ for a constant 7 to be determined later. An explicit calculation 
yields the bound 

j f H(P, Q) = -d - l)\P? - 7(Q, W(Q)) + (W(Q) + jP, R Q ) + (P + jQ, R P ) . 

Making use of (7.5) and the bounds on Rq and Rp, we see that there exists constant 
a > and C > such that 

j f H{P, Q) < -aH(P, Q) + C{1 + \p\ 2 + \q\ 2 ) . 

Since, by (7.5), H grows quadratically at infinity for 7 small enough, It follows in the 
same way as in [Hai05, Prop. 3.12] that \p\ 2 + \q\ 2 is a Lyapunov function for (7.4). 
We therefore have: 

Theorem 7.3 IfV has bounded second derivative and (7.5) holds, then there exists a 
unique invariant measure for (7.4). 

Proof. The existence of an invariant measure follows from the fact that \p\ 2 + \q\ 2 is a 
Lyapunov function. The uniqueness then follows from Theorem 6.4. □ 
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Appendix A Bounds on the cutoff function 

In this section, we show that the function q appearing in Sections 5.2 and 6.2 does 
indeed have the requested smoothness properties. Our main result is the following: 

Proposition A.l Let A^ i9 be as in (5.19) and assume that f3 and q are such that ( 5.20) 
holds and such that 2(3 < -f+H. (This is always possible by first setting (3 = (j+H)/2 
and then choosing q large enough.) 

Then, for every k > and every R > 0, there exists a constant M such that the 
bound 

||D (fe) A^(X,X)|| <M, 

holds for all (X, X) such that A ( g j(J (A, X) < 2R. Here, D (fc) denotes the kth iterated 
Malliavin derivative and || • || is the L 2 -norm on [0, T] k . 

Proof. Note first that, by definition 

where Sij is the Kronecker delta. It thus follows from (4.21) that 

BUKt = C M(* - r) H ~h r<t - (s - r) H -h r<s ) * Sijf a , t (r) . (A.l) 
The L 2 -norm of f s t is given by 

\\fs,t\\ 2 = c 2 f\t r) 2 *- 1 dr + c 2 j\(t r) 2 "- 1 - (s - r) 2 "" 1 ) dr . 

Since H < \ by assumption, one has the inequality 

\t 2H - S 2H \ < \t-s\ 2H , 
so that a straightforward calculation yields the bound 

\\f.A<*\t-*\ H . 

for some constant n > 0. 

Concerning X^ t = X^ t — X lk t , an explicit calculation yields the identity 

D*X£ = l re[a , t] (S ik (6X* it - SXi r ) S U (SX k t SX k r )) . 
Applying again (4.21), we obtain 

Djxjf t = S ik G e s<t (r) + SuG k !t (r) , 

where we set 

rt 

G k Jr) = 21 r£[M] / (u - r) H -Ux k u du 

J r 

/oo 
(u-r) H -Uu 

+ h-< s f\u - r) H -i(5Xl u - 5Xt t ) du . 

J s 
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The important fact about G k t (r) is that we can estimate it by 

|G* t (r)| <M\\X\\ 1 l reys ^t-rV +H -^ 

+ M||X|| 7 l r<s (i - S y((t - r) H -i -(s- r) H -i) , 

so that its L 2 -norm is controlled by 

\\G k Stt \\<M\\X\\,\t-s\ H ^ . (A.2) 

We finally compute the second Malliavin derivative of Xj £ t . It follows in a rather 
straightforward way from (4.21) that, for r\ e (s, t) and d ? (n, t), one has 

D^D^X^ = (SikSjt + SuSjk) (2 J\u - ri ) H -i du + J™(u - ri ) H - 'i du) , 

for n < s and v G (s, t), one has 

D^D^X^ = (J ifc fy + SuSjk) (2 ^*(« - n)*"* rfu - jf *(u - n)""* du) , 

and for all other combinations with v > n one has D^D^X^ = 0. 
A lengthy but straightforward calculation then yields 

KM^t = {SikSje + 5uS jk )g s , t (r u r 2 ) , (A3) 
where, for s < n < r 2 < t, the function g Stt is given by 

9s,t{n,r 2 ) = 2 j (v-r 2 ) H ~i j (u - n) H ~i dudv 

Jr 2 Jr 2 

+ c g (t - r 2 ) H -H2(r 2 - n) H -i -(t-n) H -^) 
= 9 { s}(r 1 ,r 2 ) + gf}(n,r 2 ) , 

for some constant c g . For r x < s < r 2 < t on the other hand, one has 

rt PV 

g s ,t(n,r 2 ) = 2 / (v-r 2 ) H ~^ (u- ri ) H ~i dudv 

+ c g (t - r 2 ) H -^(2(r 2 - n)"-^ - (s - n ) H -i - (t - n) ff "5) 
= 5^(ri,r 2 ) + fl W (ri)T . 2)> 

Finally, for n < r 2 < s < t, one has 

9s,t{n,r 2 ) = 2 f (v-r 2 ) H -i f {u - n) H ~i du dv = gf^n, r 2 ) . 
It is possible to check that 

\\gsA <M\t-s\ 2H , (A.4) 

where || • || denotes again the L 2 -norm. (We postpone the proof of this to Lemma A. 3 
below.) 

We now have all the necessary tools to conclude. Write 

k fi , q {X, X) = Kf q (X) + AfjX) , 
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where A (1) is as (5.19), but keeping only the term proportional to |<5A" s t | 2,J in the in- 
tegral, and similarly for A (2) . It follows from (A.l) that, for I < 2q, the multiple 
Malliavin derivative of A.^ satisfies the bound 



|D„ ...D.,A™(X)| < Mf(fi£,n ■ 



Since the L 2 -norm of f s , t is bounded by M\t — s\ H , it follows immediately that there 
exits a constant M such that the L -norm of D S1 . . .D S£ A™ is bounded by MA^. 
Its Malliavin derivative of order t > 2q on the other hand vanishes identically. 

Similarly, we only need to consider Malliavin derivatives of order £ < 2q for A^' g . 
A reasoning similar to the above shows that its Malliavin derivative can be written as 

D Sl ...D,A^ = y o ^ ^-pj dsdt, 

where P is a homogeneous polynomial of degree q and "s." is a generic placeholder 
for any of the times si, . . . , St. It now follows from (A. 2), (A.4), and the assumption 
2/3 < 7 + H that the L 2 -norm of D Sl . . D Sf A® is bounded by M(Af q + ||X||«). 
Since on the other hand ||X ||« is bounded by MA^, by assumption, this concludes the 
proof. □ 



Corollary A.2 Lef ^r(w+) be as defined in (6.6) with A^ j9 as in (5.19), and let f3 and 
q be as in Proposition A.l. 

Then, for every R > 0, 6 D°°. Furthermore, every multiple Malliavin deriva- 
tive of^R vanishes outside of the set {Ap^ q (X, X) < 2R}. 

Proof. By the chain rule, 

D a tt*(u; + ) = ix / (^" 1 A^,,(A-,X))D i A^ ia (X ) X) , 

and similarly for higher order derivatives. Since all derivatives of \ vanish when the 
argument is larger than 2, the claim follows from Proposition A.l. □ 



Lemma A.3 For every T > 0, there exists a constant M such that the function g s , t 
from (A.3) satisfies < M\t - s\ 2H . 

Proof. We show the bound separately for with j — 1, . . . , 5. For g^\, we use the 
bound 

(u - n) H -i <(u- n) H -i(r 2 - ri y p , 

in order to conclude that, provided that 1 — 2H < (3, one has the pointwise bound 

|$(ri,r 2 )| < (t - r 2 f H -^(r 2 n)"' < \t s\ 2H -^(r 2 n)"" . 

If furthermore f3 < \ (which is always possible if H > j) the integral of (r 2 — ri)~ 2/3 
over s < ri < r 2 < t is proportional to \t — s| 1_2/3 , thus yielding the required bound. 
Similarly, g s [ satisfies 

\9?}(n,r 2 )\ < C(t - r 2 ) H -Hr 2 - n)*"* , 
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and a straightforward calculation shows that 

ft rr 2 



f (t- r 2 f H - x f\r 2 - n) 211 - 1 dn dr 2 = C\t - s 

J s J s 



4H 



for some constant C as required. 
For g s ] we have, as for g^], 

\9?}(n,r 2 )\ < it - r 2 ) 2ff - 1+ ^(r 2 - n)"* 3 . 
This time however we choose /3 e (|, 1), so that 

f j^g^ir^tdndr^M f{t-r 2 ) iH - 2+ ^{r 2 -sf-^dr 2 , 

Js JO Js 

which is indeed proportional to \t — s\ 4H . 

To bound g^\ we perform the change of variables n M> s — r\ and r 2 h-> r 2 + s, 
so that 



i />s ft—s 

2H-1 



I \g^(n,r 2 )\ 2 d ri dr 2 = M [ (t-s-r 2 f 
Jo Jo 

f' s 1 

x / (2(r 2 + n) ff -3 -rf~ 5 -(t-s + ri) H -3) 2 dridr 2 . 
Jo 

We then dilate the expression by t — s, showing that it is proportional to 

\t-s\ iH [ f t '\l-r 2 ) 2H - 1 (2(r 2 + r 1 ) H -i - r*~ h - (1 + n ) H -^) 2 d n dr 2 . 
Jo Jo 

It is straightforward to check that this integral converges for all H G (0, 1), which 
shows the requested bound on g^\. 

Finally, to bound gf], we perform the change of variables n M> s — n and r 2 
s — r 2 , followed by a dilatation of t — s, so that 

f r \9f, ) t(ri,r 2 )\ 2 dr 1 dr 2 = \t-s\ 4H /*"* [ ^ \g (5 \ ri ,r 2 )\ 2 dn dr 2 , 

Jo Jo Jo Jr 2 

where 

/0 ITJ 

Note now that, for every (3 G (0, | — H), there exists a constant M such that, for 
ri > r 2 , one has the bound 

/0 / _ n-/3 2fl-l+ / 8 

(r 2 - «) 2ff - 2 +^ rf w < ikf -— i — — — — dv . 

-i 1 + r 2 

Choosing /? ~ ^ (but slightly larger than i) for r x > r 2 + 1 and /3 = for r x < r 2 , 
we can check that 

/>OG r-OO 

/ / \9 (5 \ri,r 2 )\ 2 dndr 2 < oo , 
so that the claim follows. □ 
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